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CLEAVAGE AND POLYADEN YLATION COMPLEX OF PRECURSOR MRNA 



2. BACKGROUND OF THE INVENTION 

Polyadenylation of precursor mRNA (pre-mRNAs) is an obligatory step in the maturation 
of most eukaryotic transcripts. The addition of poly(A) (polyadenosine) tails promote 
transcription termination and export of the mRNA from the nucleus. Furthermore, the 
poly(A) tails have the function to increase the efficiency of translation initiation and to 
help to stabilize mRNAs. Polyadenylation occurs posttranscriptionally in the nucleus of 
eukaryotic cells in two tightly coupled steps: the endonucleolytic cleavage of the 
precursor and the addition of a poly(A) tail. 

In the yeast Saccharomyces cerevisiae, the pre-mRNA 3-end processing signals are not 
as well conserved as in mammalian cells (see below). In addition to the cleavage and 
polyadenylation site, two cis-acting elements, called the efficiency element and the 
positioning element, are found upstream of the cleavage site. Efficiency elements contain 
the sequence UAUAUA (or close variants thereof) and are often repeated. The sequence 
AAUAAA and several related sequences can function as a positioning element. 

Fractionation of yeast extracts led to the separation of protein factors that are required 
for mRNA 3‘-end formation in vitro. The cleavage reaction requires cleavage factors I and 
II (CF I and CF II), whereas polyadenylation involves CF I, polyadenylation factor I (PF I) 
and poly(A) polymerase (Papl). 

CF I can be separated into two activities, CF IA and CF IB. CF IA is needed for both 
processing steps and is a heterotetrameric protein with subunits of 38, 50, 70 and 76 
kDa that are encoded by the RNA5, CLP1, PCF1 1 and RNA14 genes. Rna14 shares 
significant sequence similarity to the 77 kDa subunit of mammalian cleavage stimulation 
factor (CstF) and Rna15 contains a RNA-binding domain homologous to that of the 64 
kDa subunit of CstF. 
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In addition to the above mentioned four CFI subunits, Pabl (poly(A) binding protein) was 
identified in purified CFI fractions. Both biochemical and genetic data indicate an 
involvement of Pabl in poly(A) length control. CF IB consists of a single protein called 
Nab4/Hrp1 and is required for cleavage site selection and polyadenylation. 

A multiprotein complex which has CFII-PF I (= CPF) activity consists of nine 
polypeptides: Papl (poly(A) polymerase), Ptal, Pfsl, Pfs 2, Fipl, Cftl/Yhhl, Cft2/Ydh1, 
Ysh1/Brr5, and Ythl . Papl , a 64 kDa protein, was the first component of the yeast 3-end 
formation complex to be purified to homogeneity. Ptal is a 90 kDa protein which is 
required for both cleavage and polyadenylation of mRNA precursors. Pfs2 is a 53 kDa 
protein that contains seven WD40 repeats. Pfs2 has been shown to directly interact with 
subunits of CFII-PF1 and CFIA and is thought to function in the assembly and 
stabilization of the 3’-end processing complex. Fipl has been demonstrated to physically , 
interact with Papl , Ythl and Rna14 and it is believed that it tethers Papl to its substrate 
during polyadenylation. Cftl/Yhhl, Cft2/Ydh1, Ysh1/Brr5, and Ythl are the counterparts 
of the four subunits of the mammalian cleavage and polyadenylation specificity factor, 
CPSF160, CPSF100, CPSF73 and CPSF30, respectively. 

Furthermore TIF4632 has been found to interact with Pabl (see Table 1) 

For the mammalian system, various data have been presented which have given 
evidence both for a conserved mechanism and also showed some differences between 
the yeast and the mammalian structures. 

The composition and function of the mammalian complex based on the data to date is as 
follows: 

The cleavage and polyadenylation factor (CPSF) is composed of 4 subunits: 

CPSF160 (involved in mRNA and poly(A) polymerase (PAP) binding), CPSF100, CPSF 
73 and CPSF30 (involved in mRNA and PABII binding). 

CPSF binds the AAUAAA hexanucleotides. CPSF links the mRNA 3'-end processing to 
the transcription. CPSF exists as a stable complex with the transcription factor TFIID 
complex. The 160 kDa subunit of CPSF binds to several hTAFII. TFIID recruits CPSF to 
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the RNA polymerase II pre-initiation complex. Upon transcriptional activation CPSF 
dissociates from TFII and associates with the elongating RNA pol II (CTD carboxy- 
terminal domain of the largest subunit of the RNA polymerase II). CPSF is thought to 
travel with RNA pol II until they reach the polyadenylation site, where CPSF can bind the 
AAUAAA element. CPSF is required for the termination of transcription. 

The interaction between CPSF and the AAUAAA element is weak and not so specific. 
The binding of CPSF to the hexanucleotide is greatly enhanced by a 2nd component of 
the poly-adenylation machinery, the cleavage stimulation factor (CstF), which binds the 
G-U rich motif. CstF also binds the RNA pol II through its 50 kDa subunit (CstF50). 
Furthermore, CstF50 binds another component of the transcriptional machinery. BRCA1 
associated RING domain protein (BARD1). BARD1 also interacts with RNA pol II. 
BARD1-CstF50 interaction inhibits polyadenylation in vitro and may prevent 
inappropriate mRNA processing during transcription. CstF is composed of 3 subunits: 
CstF64 (binds mRNA and symplekin (yeast homolog: Ptal), CstF77 (binds CPSF160, 
CstF64, CstF50) and CstF50 (binds RNA pol II and BARD1). The co-operative binding of 
CPSF and CstF to the polyadenylation site forms a ternary complex, which functions to 
recruit the other components of the polyadenylation machinery to the cleavage site: the 
two cleavage factors (CFIm and CFIIm) and the poly(A) polymerase (PAP). 

CFIm is an heterodimer of 4 subunits 72, 68, 59, 25 components: one essential, CFIImA 
and one stimulatory, CFIIB. CFIImA contains hPCFIIp and hCIpIp (binds cPSF and CF 
I). CF IlmB contains no factors previously shown to be involved in 3'-end processing and 
may be a new 3’-end processing factor. Although the identity of the proteins that perform 
the cleavage step is still unknown, it is well established that both CFIm and CFIIm are 
required. The reaction products of the cleavage suggest that a metal ion is involved. 
Surprisingly, PAP (but not its catalytic activity) is required for the cleavage. 

After the cleavage step CstF, CFIm and CFIIm are dispensable. PAP bound to CPSF 
(through its 160 kD subunit) can start polyadenylating the cleaved 3'-end, but at that 
step, the process is very inefficient. The poly(A) binding protein II (PAB II) can bind the 
nascent poly(A) chain as soon as it reaches a minimal length of 10 poly(A). PAB II also 
interacts with the CPSF30. The binding of PAB II greatly stabilizes PAP at the 3-end of 
the mRNA, supporting the progressive synthesis of a long poly(A) tail. In the nucleus, 
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the length of the poly(A) tail is restricted to about 250 poly(A). This size restriction is 
probably achieved through stoichiometric binding of multiple PAB II. It is not yet known 
how the incorporation of a certain amount of PAB II in the complex terminates processive 
elongation. 

CstF is part of the mammalian 3-end processing complex and is a heterotrimeric protein 
with subunits of 77, 64 and 50 kDa. CstF-50 has been shown to interact with the BRCA1- 
associated protein BARD1 and this interaction suppresses the nuclear mRNA 
polyadenylation machinery in vivo. In a recent study it was found that treatment of cells 
with DNA damage-inducing agents causes a transient, but specific, inhibition of mRNA 
3'-end processing in cell extracts. This inhibition reflects the BARDI/CstF interaction and 
involves enhanced formation of a CstF/BARDI/BRCAl complex. A tumor-associated 
germline mutation in BARD1 decreases binding to CstF-50 and renders the protein 
inactive in polyadenylation inhibition. These results support the existence of a link 
between mRNA 3'-end formation and DNA repair/tumor suppression. The in vivo function 
of these interactions may be to inhibit the cleavage and polyadenylation of pre-mRNAs 
on polymerase molecules that are stalled at sites of DNA repair. 

Cleavage stimulation factor (CstF) is one of the multiple factors required for mRNA 
polyadenylation in mammalians. CstF-64 may play a role in regulating gene expression 
and cell growth in B cells. The concentration of one CstF subunit (CstF-64) increases 
during activation of B cells, and this is sufficient to switch IgM heavy chain mRNA 
expression from membrane- bound to secreted form. Reduction in CstF-64 causes 
reversible cell cycle arrest in G0/G1 phase, while depletion results in apoptotic cell death. 

In contrast to what is observed in yeast, the sequence elements in mammals, which 
specify the site of cleavage and polyadenylation, flank the site of endonucleolytic attack. 
One is the hexanucleotide AAUAAA found 10-30 bases upstream of the 
cleavage/polyadenylation site. The second is a G-U-rich motif located 20-40 bases 
downstream of the cleavage/polyadenylation site. These two elements and their spacing 
determine the site of cleavage/polyadenylation and also the strength of the 
polyadenylation signal. 
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Some other elements, like sequences upstream of the AAUAAA (upstream sequence 
elements, USEs) play regulatory roles. 

A schematic presentation of the motifs underlying mammalian polyadenylation and yeast 
polyadenylation are shown in Fig. 1. A review on the formation of mRNA 3'-ends in 
eukaryotes is given in Zhao, Hyman and Moore in Microbiology and Molecular Biology 
Reviews, 1999, pp. 405-445. A comparison of mammalian and yeast pre-mRNA 3'-end 
processing is also given in Keller and Minvielle-Sebastia in Nucleus and gene expression 
in Current Opinion in Cell Biology, 1997, Vol. 9, pp. 329-336. 

There are diseases which involve defects in the function of the polyadenylation 
machinery. 

Many viruses interact directly with components of the mRNA processing machinery. 

The herpes simplex virus type 1 (HSV-1) immediate early (alpha) protein ICP27 is an 
essential regulatory protein that is involved in the shutoff of host protein synthesis,. It 
affects mRNA processing at the level of both polyadenylation and splicing. During 
polyadenylation, ICP27 appears to stimulate 3' mRNA processing at selected poly(A) 
sites. The opposite effect occurs on host cell splicing. That is, during HSV-1 infection, an 
inhibition in host cell splicing requires ICP27 expression. This contributes to the shutoff of 
host protein synthesis by decreasing levels of spliced cellular mRNAs available for 
translation. A redistribution of splicing factors regulated by ICP27 has also been seen. 

Epstein-Barr virus BMLF1 gene product EB2 seems to affect mRNA nuclear export of 
intronless mRNAs and pre-mRNA 3' processing. EB2 contains an Arg-X-Pro tripeptide 
repeated eight times, similar to that described as an mRNA-binding domain in the herpes 
simplex virus type 1 protein US1 1 . 

Interestingly, both viruses have been found to precede the onset of lymphomas. 

Influenza A virus NS1 A protein binds the 30 kDa subunit of the cleavage and 
polyadenylation specificity factor (CPSF), NS1 protein (NS1A protein) via its effector 
domain targets the poly(A)-binding protein II (PABII) of the cellular 3 -end processing 
machinery. In vitro the NS1A protein binds the PABII protein, and in vivo causes PABII 
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protein molecules to relocalize from nuclear speckles to a uniform distribution throughout 
the nucleoplasm. In vitro the NS1 A protein inhibits the ability of PABII to stimulate the 
processive synthesis of long poly(A) tails catalyzed by poly(A) polymerase (PAP). Such 
inhibition also occurs in vivo in influenza virus-infected cells. Consequently, although the 
NS1 A protein also binds the 30 kDa subunit of the cleavage and polyadenylation 
specificity factor (CPSF), 3' cleavage of some cellular pre-mRNAs still occurs in virus- 
infected cells, followed by the PAP-catalyzed addition of short poly(A) tails. Subsequent 
elongation of these short poly(A) tails is blocked because the NS1A protein inhibits PABII 
function. The NS1 effector domain functionally interacts with the cellular 30 kDa subunit 
of CPSF, an essential component of the 3’ end processing machinery of cellular pre- 
mRNAs. 

Metachromatic leukodystrophy (MLD) is a lysosomal storage disorder caused by the 
deficiency of arylsulfatase A (ASA). A substantial ASA deficiency has also been 
described in clinically healthy persons, a condition for which the term pseudodeficiency 
was introduced. The mutations characteristic for the pseudodeficiency (PD) allele have 
been identified. Sequence analysis revealed two A-G transitions. One of them changes 
the first polyadenylation signal downstream of the stop codon from AATAAC to AGTAAC. 
This causes a severe deficiency of a 2.1-kilobase (kb) mRNA species. The deficiency of 
the 2.1 -kb RNA species provides an explanation for the diminished synthesis of ASA 
seen in pseudodeficiency fibroblasts. 

MLD patients have been identified who are homozygous for the ASA-PD allele and it is 
thought that the allele might play a role in the development and progression of disease. 

There is a tight link between cell cycle control and polyadenylation machinery suggesting 
an important role of this machinery in the development of cancer. Cyclin-dependent 
enzymes seem to regulate the activity of the polyadenylation machinery. The amounts of 
some factors of the mRNA 3’ processing machinery (CstF) increase in mitotically active 
cells in phases of the cell cycle preceding DNA synthesis. The amount of the 64-kDa 
subunit CstF-64 increases 5-fold during the GO to S phase transition and concomitant 
proliferation induced by serum in 3T6 fi-broblasts. The increase in CstF-64 is associated 
with the GO to S phase transition. Cdc2-cyclin B phosphorylates PAP at the Ser-Thr-rich 
region. 
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However, as it seems now, most diseases associated with defects in mRNA processing 
are caused by mutations in cis-acting elements that disrupt sequences essential for pre- 
mRNA splicing. These can be canonical sequences at the intron-exon border or located 
within an exon. They directly affect the expression of a single mutated gene. 
Approximately 15% of the nucleotide substitutions that cause human diseases disrupt 
pre-mRNA splicing. Thus these diseases do not seem to be directly caused by 
alterations in the polyadenyation/cleavage-machinery. 

However, since recently evidence for a number of interrelationships between 
polyadenylation/cleavage and splicing is accumulating (for review see Zhao, Hyman and 
Moore in Microbiology and Molecular Biology Reviews, 1999, pp. 405-445), it might very 
well be that alterations in the 3‘-end processing machinery contribute to the etiology of 
these diseases. 

Examples of diseases caused by incorrect splicing are mentioned below. 

Amyotrophic lateral sclerosis (ALS) is a neurodegenerative disease involving 
degeneration of cortical motor neurons and spinal/bulbar motor neurons. In the sporadic 
form of the disease, the neuron degeneration is caused by excessive extracellular 
glutamate. The glutamate transporter functional in the CNS is the astrocyte EAAT2 which 
is altered in ALS. The pre-mRNA for EAAT2 is aberrantly spliced in the brain regions 
affected. The reason for this is still unknown, but the defect lies probably in one or a few 
auxiliary splicing factors that regulate the splicing of a sub-set of pre-mRNA in these 
cells. The factors have not yet been identified. 

The human papillomavirus (HPV) E2 protein plays an important role in transcriptional 
regulation of viral genes as well as in viral DNA replication. HPV-5 (an EV 
epidermodysplasia verruciformis-HPV) protein can specifically interact with cellular 
splicing factors including a set of prototypical SR proteins and two snRNP-associated 
proteins (Lai, Teh et al. 1999, J. Biol. Chem. 274, pages 11832-41). interestingly ail 
these three viruses have been associated with cancer progression. Papillomavirus 
infection precedes cervical cancer, whereas EBV and HSV-8 have been described in 
association with lymphomas. 
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In hepatocellular carcinoma, there is a defect in mRNA splicing. In this disease, there are 
anti-nuclear antibodies to a 64 kD protein, which has splicing factor motifs. A defect in 
the regulatory subunit 3 of the protein phosphatase 1(PP1) has been found in 
haematological malignancies and in lung, ovarian, colorectal and gastric cancers. Low 
PP1 activity has been observed also in acute myelogenous leukaemia. 

Heterogeneous nuclear ribonucleoproteins (hnRNP) associate with pre-mRNA and have 
a role in RNA processing and splice site selection. HnRNP A2 shows a marked 
overexpression in lung cancer and brain tumours and has thus been used as a 
biomarker for these tumor types. 

The development of antinuclear antibodies (ANA) in malignancies has been described 
but its mechanism is still not understood. A great diversity of ANA specificities is found in 
hepatocellular carcinoma. In hepatoma sera antibodies co-localize with non-snRNP 
splicing factor SC35, suggesting that the antigenic targets might be involved in mRNA 
splicing. Hepatocellular carcinoma has a significantly higher frequency of ANA than 
chronic hepatitis C, chronic hepatitis B, alcoholic liver cirrhosis or healthy donors. 

In some autoimmune diseases, a possible link has been detected to a preceding virus 
infection, like Epstein-Barr virus in SLE. Furthermore it seems that even vaccination is 
potentially dangerous: a candidate for cytomegalovirus CMV vaccine is glycoprotein gB 
(UL55). Immunization with an adenovirus-gB construct (Ad-gB) not only induces a 
significant anti-viral response, but a significant IgG auto-antibody response (p > 0.005) 
to the U1-70 kDa spliceosome protein. Auto-antibodies to U1-70 kDa are part of the anti- 
ribonucleoprotein response seen in systemic lupus erythematosus and mixed connective 
tissue disease. 

At least two molecules which are also part of the complex are known to be inhibited by 
natural toxins or treatment against various diseases. 



Protein phosphatase 1 is inhibited by several natural product toxins. 
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The marine toxins include the cyanobacteria-derived cyclic heptapeptide microcystin-LR 
and the polyether fatty acid okadaic acid from dinoflagellate sources. They bind to a 
common site on PP1 . The dephosphorylation of PP1 is inhibited ( among other 
serine/threonine phosphatases PP2A, PP2B, PP2C and PP5/T/K/H ) by Fumonisin B1 
(FBI), a mycotoxin produced by the fungus Fusarium moniliforme. This is a common 
contaminant of corn, and is suspected to be a cause of human esophageal cancer. FBI 
is hepatotoxic and hepatocarcinogenic in rats, although the mechanisms involved have 
not been clarified. 

Viral proteins are able to interfere with PP1 activity: 

The transcription factor EBNA2 of the Epstein-Barr virus induces the expression of LMP1 
onco-gene in human B- cells. EBNA2A from an EBV-immortalized B-cell line co- 
immunopurifies with a PPI-like protein. A PPI-like activity in nuclear extracts from EBV- 
immortalized B-cell line can be inhibited by a GST-EBNA2A fusion product. 

Poly(A)polymerase (PAP) is affected by anticancer drugs and is inhibited by some 
antiviral agents. 

Anticancer drugs: 

Most anticancer drugs act through the mechanism of apoptosis. Apoptosis may be 
regulated at all levels of gene expression including the addition of the poly(A) tail to the 3* 
end of mRNAs. Drug combinations are more effective than single drugs and various 
chemotherapeutic strategies have therefore been developed. Dimethylsulfoxide (DMSO) 
in combination with interferon (IFN) results in pronounced PAP dephosphorylation, 
activity reduction and apoptosis of HeLa cells. 

Purine and pyrimidine analogues often affect PAP activity. They are potentially useful 
agents for chemotherapy of cancer diseases. The anticancer drugs 5-Fluorouracil (5- 
FU), interferon and tamoxifen mediate both partial dephosphorylation and inactivation of 
poly(A) polymerase (PAP). 

PAP (from isolated hepatic nuclei) is inhibited by cordycepin 5-triphosphate.The 
nucleoside analogue cordycepin is a therapeutic agent for TdT+ (terminal 
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deoxynucleotidyl transferase positive) leukemia. In the presence of an adenosine 
deaminase inhibitor, deoxycoformycin (dCF), cordycepin is cytotoxic to leukemic TdT+ 
cells. A cordycepin analog of (2’-5') oligo(A) which can be synthesized enzymatically 
from cordycepin 5'-triphosphate and the core cordycepin analog can replace human 
fibroblast interferon in preventing the transformation of human lymphocytes after 
infection with Epstein-Barr virus B95-8 (EBV). The core cordycepin analog is not 
cytotoxic to uninfected lymphocytes and proliferating lymphoblasts. 

Not only is PAP affected by anticancer drugs, but it has a possible use as a tumor 
marker involved in cell commitment and/or induction of apoptosis and could be used to 
evaluate tumor cell sensitivity to anticancer treatment. 

Antiviral drugs: 

Ara-ATP (arabinofuranosyladenosine triphosphate) is an antiherpetic drug that inhibits 
herpes simplex virus replication. It inhibits poly(A) polymerase activity by competing with 
ATP. It blocks both cleavage and polyadenylation reactions by interacting with the ATP- 
binding site on poly(A) polymerase, the activity of which is essential for the cleavage 
reaction. 

Purine and pyrimidine analogues are also used as antiviral agents. As an example, the 
most extensively used drug against HSV is idoxyuridine, the 5'-amino analog of 
thymidine. 

A decrease in herpes simplex virus transcription and perturbation of RNA 
polyadenylation is induced by 5’-amino-5-deoxythymidine (AdThd). 

The cleavage stimulation factor (CSTF): 

Treatment with hydroxyurea or ultraviolet light strongly, but transiently, inhibits 3’ 
cleavage. This is accompanied by increased amounts of a CstF/BARDI/BRCAl 
complex, though the amount of these proteins remains the same. 
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Despite the large body of information already available from the prior art concerning the 
cleavage/polyadenylation machinery of precursor mRNA up to now not all components of 
the machinery are known not to speak of the composition of the complex as a whole. 



3. SUMMARY OF THE INVENTION 



An object of the present invention was to identify the components of the 
cleavage/polyadenylation machinery of precursor mRNA and provide new components of 
the cleavage/polyadenylation machinery to provide the machinery and to provide new 
targets for therapy. 

By applying the process according to the invention to the isolation of the 
polyadenylation/cleavage machinery from yeast 32 new components could be identified 
which are Actl (SEQ ID:1), Ckal (SEQ ID:7), Eft2 (SEQ ID 11), Eno2 (SEQ ID: 13), Glc7 
(SEQ ID:15), Gpml (SEQ ID:17), Hhf2 (SEQ ID:21), Htal (SEQ ID:23), Hsc82 (SEQ 
ID:25), Imd2 (SEQ ID.27), Imd4 (SEQ ID:29), Met6 (SEQ ID:31), Pdcl (SEQ ID:39), Pfkl 
(SEQ ID:41), Ref2 (SEQ ID:47), Sec13 (SEQ ID:53), Sec31 (SEQ ID:55), Ssa3 (SEQ 
ID:57), Ssu72 (SEQ ID: 59), Taf60 (SEQ ID:61), Tkll (SEQ ID:65), Tsai (SEQ ID: 67), 
Tye7 (SEQ ID: 69), Vid24 (SEQ ID:71), Vps3 (SEQ ID: 73), Ycl046w (SEQ ID: 79), 
Ygr156w (SEQ ID: 81), Yhl035c (SEQ ID:83), Ykl018w (SEQ ID:85), Ylr221c (SEQ ID: 
87), Yml030w (SEQ ID.91) and Yor179c (SEQ ID:93). 

Said object is further achieved by the characterisation of Ycl046w (SEQ ID: 79), Ygr156w 
(SEQ ID: 81), Yhl035c (SEQ ID:83), Ykl018w (SEQ ID:85), Ylr221c (SEQ ID: 87), 
Yml030w (SEQ ID:91) and Yor179c (SEQ ID:93) as components of the 
cleavage/polyadenylation machinery. 

The invention thus relates to: 

An isolated complex selected from 
complex (I) and comprising 
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(a) a first protein, or a functionally active fragment or functionally active derivative 
thereof, which first protein is selected from the group of proteins in T able 1 , column A, or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic acid that 
hybridizes to the nucleic acid of said protein or its complement under low stringency 
conditions; and 

(b) a second protein, or a functionally active fragment or functionally active 
derivative thereof, which second protein is selected from the group of proteins in Table 1, 
column B, or a mammalian homolog thereof, or a variant of said protein encoded by a 
nucleic acid that hybridizes to the nucleic acid of said protein or its complement under 
low stringency conditions, wherein said first protein and said second protein are 
members of a native cellular Polyadenylation-complex, and wherein said low stringency 
conditions comprise hybridization in a buffer comprising 35% formamide, 5X SSC, 50 
mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml 
denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 40°C, 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 
0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at60°C and a complex (II) 
comprising at least two second proteins. 

Furthermore, the invention relates to an isolated complex 
comprising all proteins in column C of table 1, or the mammalian homologs of those 
proteins, or variants of said proteins encoded by nucleic acid that hybridises to the 
nucleic acid of any of said proteins or its complements under low stringency conditions, 
wherein proteins are members of a native cellular complex, and wherein said low 
stringency conditions comprise hybridization in a buffer comprising 35% formamide, 5X 
SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 
ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 
40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, 
and 0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 
mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to an isolated complex that comprises all but 
1,2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,18,19,20,21,22,23,24,25,26,27 or 28 
of all proteins in column C of table 1 . 

Furthermore, the invention relates to the complex as described above comprising 
a functionally active derivative of said first protein and/or a functionaliy active derivative 




WO 02/092626 



PCT/EP02/05359 



13 

comprising said first protein or said second protein fused to an amino acid sequence 
different from the first protein or second protein, respectively. 

In a preferred embodiment of the present invention, the protein components of the 
complex are vertebrate homologs of the yeast proteins, or a mixture of yeast and 
vertebrate homolog proteins. In a more preferred embodiment, the protein components 
of the complex are mammalian homologs of the yeast proteins, or a mixture of yeast and 
mammalian homolog proteins. In particular aspects.n the native component proteins, or 
derivatives or fragments of the complex are obtained from a mammal such as mouse, 
rat, pig, cow, dog, monkey, human, sheep or horse. In another preferred embodiment, 
the protein components of the complex are human homologs of the yeast proteins, or a 
mixture of yeast and human homolog proteins. In yet another preferred embodiment, the 
protein components of the complex are a mixture of yeast, vertebrate, mammalian and/or 
human proteins. 

Furthermore, the invention relates to a complex as described aboveof claim that is 
involved in the 3' end processing activity. Such a complex might also exist as a module 
or subcomplex of a larger physiological protein complex or assembly. 

Furthermore, the invention relates to a complex as described above comprising a 
fragment of said first protein and/or a fragment of said second protein, which fragment 
binds to another protein component of said complex. 

Furthermore, the invention relates to a complex as described above, wherein the 
functionally active derivative is a fusion protein comprising said first protein or said 
second protein preferentially fused to an affinity tag or label. 

It is further directed to complexes comprising a fusion protein which comprises a 
component of the complex or a fragment thereof linked via a covalent bond to an amino 
acid sequence different from said component protein, as well as nucleic acids encoding 
the protein, fusions and fragments thereof. For example, the non-component protein 
portion of the fusion protein, which can be added to the N-terminal, the C-terminal or 
inserted into the amino acid sequence of the complex component can comprise a few 
amino acids, which provide an epitope that is used as a target for affinity purification of 
the fusion protein and/or complex. 

Furthermore the invention relates to a process for processing RNA comprising the 
step of bringing into contact any of the complexes described above with RNA, such that 
RNA is processed. 
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Furthermore, the invention relates to an antibody or a fragment of said antibody 
containing the binding domain thereof, which binds the complex as described above of 
claim and which does not bind the first protein when uncomplexed or the second protein 
when uncomplexed. 

Furthermore, the invention relates to a pharmaceutical composition comprising 
the protein complex as described above and a pharmaceutically acceptable carrier. 

Moreover, the present invention provides a process for the identification and/or 
preparation of an effector of a composition according to the invention which process 
comprises the steps of bringing into contact the composition of the invention or of a 
component thereof with a compound, a mixture of compounds or a library of compounds 
and determining whether the compounds or certain compounds of the mixture or library 
bind to the composition of the invention and/or a component thereof and/or affects the 
biological activity of such a composition or component and then optionally further 
purifying the compound positively tested as effector by such a process. 

A major application of the composition according to the invention results in the 
identification of an active agent capable of binding thereto. Hence, the compositions of 
the invention are useful tools in screening for new pharmaceutical drugs. 

Furthermore, the invention relates to a method for screening for a molecule that 
modulates directly or indirectly the function, activity, composition or formation of the 
complex as described above comprising the steps of : 

(a) exposing said complex, or a cell or organism containing said complex to one or more 
candidate molecules; and 

(b) determining the amount of, the 3' end processing activity for mRNA of, or protein 
components of, said complex, wherein a change in said amount, activity, or protein 
components relative to said amount, activity or protein components in the absence of 
said candidate molecules indicates that the molecules modulate function, activity or 
composition of said complex. 

Furthermore, the invention relates to a method as described above, wherein the 
amount of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein the 
activity of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises isolating from the cell or organism said complex to produce 
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said isolated complex and contacting said isolating complex with the substrate under 
conditions conducive to binding to the complex. 

Furthermore, the invention relates to a method as described above, wherein the 
protein components of said complex are determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises determining whether any of the proteins listed in column B of 
table 1 of said complex or the mammalian homologs thereof, or variant of said proteins 
encoded by a nucleic acid that hybridises to the nucleic acids of any of said proteins or 
its complements under low stringency conditions, is present in the complex, wherein said 
low stringency conditions comprise hybridization in a buffer comprising 35% formamide, 
5XSSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 
100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 
hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM 
EDTA, and 0.1 % SDS for 1.5 hours at 55°C, and washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to a method as described above, wherein said 
method is a method of screening for a drug for treatment or prevention of diseases and 
disorders, preferably diseases or disorders such as infectious diseases; viral infections 
such as herpes simplex infections, Epstein-Barr-infections, influenza; metabolic disease 
such as metachromatic leukodystrophy; neurodegenerative disorders such as 
amyotrophic lateral sclerosis and cancer. 

Furthermore, the invention relates to a method for screening for a molecule that 
binds the complex as described above comprising the following steps: 

(a) exposing said complex, or a cell or organism containing said complex, to one or more 
candidate molecules; and 

(b) determining whether said complex is bound by any of said candidate molecules. 

Furthermore, the invention relates to a method for diagnosing or screening for the 
presence of a disease or disorder or a predisposition for developing a disease or 
disorder in a subject, which disease or disorder is characterized by an aberrant amount 
of, the 3' end processing activity for mRNA biochemical activity of, or component 
composition or formation of, the complex as described above, comprising determining 
the amount of, the 3' end processing activity for mRNA of, or protein components of, said 
complex in a sample derived from a subject, wherein a difference in said amount, 
activity, or protein components of, said complex in an analogous sample from a subject 
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not having the disease or disorder or predisposition indicates the presence in the subject 
of the disease or disorder or predisposition. 

Furthermore, the invention relates to a method as described above, wherein the 
amount of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein the 
activity of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises isolating from the cell or organism said complex to produce 
said isolated complex and contacting said isolating complex with the substrate under 
conditions conducive to binding to the complex. 

Furthermore, the invention relates to a method as described above, wherein the 
protein components of said complex are determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises determining whether any of the proteins listed in column B of 
table 1 of said complex or the mammalian homologs thereof, or variant of said proteins 
encoded by a nucleic acid that hybridises to the nucleic acids of any of said proteins or 
its complements under low stringency conditions, is present in the complex, wherein said 
low stringency conditions comprise hybridization in a buffer comprising 35% formamide, 
5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 
100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 
hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM 
EDTA, and 0.1 % SDS for 1.5 hours at 55°C, and washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to a method for treating or preventing a disease 
or disorder characterized by an aberrant amount of, the 3’ end processing activity for 
mRNA of, or component composition or formation of, the complex as described above, 
comprising administering to a subject in need of such treatment or prevention a 
therapeutically effective amount of one or more molecules that modulate the amount of, 
the 3' end processing activity for mRNA of, or protein components or formation of, said 
complex. 

Furthermore, the invention relates to a method as described above, wherein said 
disease or disorder involves decreased levels of the amount or activity of said complex. 
Furthermore, the invention relates to a method as described above, wherein said disease 
or disorder involves increased levels of the amount or activity of said complex. 
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Furthermore, the invention relates to the use of a molecule that modulates the 
amount of, the 3' end processing activity for mRNA of, or protein components or 
formation of the complex as described above for the manufacture of a medicament for 
the treatment or prevention of a disease or disorder, preferably diseases or disorders 
such as infectious diseases; viral infections such as herpes simplex infections, Epstein- 
Barr-infections, influenza; metabolic disease such as metachromatic leukodystrophy; 
neurodegenerative disorders such as amyotrophic lateral sclerosis; cancer 

Furthermore, the invention relates to a kit comprising in one or more containers 

(a) an isolated first protein, or a functionally active fragment or functionally active 
derivative thereof selected from the proteins in column A of table 1 of a given complex or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic acid that 
hybridises to the nucleic acid of said protein or its complement under low stringency 
conditions; and 

(b) an isolated second protein, or a functionally active fragment or functionally active 
derivative thereof selected from the proteins in column B of table 1 of a given complex or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic add that 
hybridises to the nucleic acid of said protein or its complement under low stringency 
conditions, wherein said first and said second protein are members of a native cellular 
complex, and wherein said low stringency conditions comprise hybridization in a buffer 
comprising 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% 

PVP, Q.02% Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% 

(wt/vol) dextran sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1 .5 hours at 55°C, and 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 
0.1% SDS for 1 .5 hours at 60°C. 

Furthermore, the invention relates to a kit comprising in a container the isolated 
complex as described above or the antibody as described above, optionally together with 
further reagents and working instructions. The further reagents may be, for example, 
buffers, substrates for enzymes but also carrier material such as beads, filters, 
microarrays and other solid carries. The working instructions may indicate how to use 
the ingredients of the kit in order to perform a desired assay.. 

Furthermore, the invention relates to such kits for use in processing of RNA and 
for use in the diagnosis, prognosis and screening in or for the diseases mentioned 
above. 
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Furthermore, the invention relates to a complex as described above, or the 
antibody or fragment as described above, for use in a method of diagnosing a disease or 
disorder, preferably the diseases or disorders as mentioned above. 

Furthermore, the invention relates to a method for the production of a 
pharmaceutical composition comprising carrying out the method as described above to 
identify a molecule that modulates the function, activity or formation of said complex, and 
further comprising mixing the identified molecule with a pharmaceutically acceptable 
carrier. 

Furthermore, the invention relates to a process for preparing complex as 
described above and optionally the components thereof comprising the following steps: 
expressing such a protein in a target cell, isolating the protein complex which is attached 
to the tagged protein, and optionally disassociating the protein complex and isolating the 
individual complex members. 

Furthermore, the invention relates to the process as described above 
characterized in that the tagged protein comprises two different tags which allow two 
separate affinity purification steps. 

Furthermore, the invention relates to the process as described above, 
characterized in that two tags are separated by a cleavage site for a protease. 

Furthermore, the invention relates to a component of the said complex obtainable 
by a process as described above. 

The present invention further relates to a composition, preferably a protein 
complex, which is obtainable by the method comprising the following steps: tagging a 
protein as defined above, i.e. a protein which forms part of a protein complex, with a 
moiety, preferably an amino acid sequence, that allows affinity purification of the tagged 
protein and expressing such protein in a target cell and isolating the protein complex 
which is attached to the tagged protein. The details of such purification are described in 
WO 00/09716 and in Rigaut, G. et al. (1999), Nature Biotechnology, Vol. 17 (10): 1030- 
1032 and further herein below. The tagging can essentially be performed with any 
moiety which is capable of providing a specific interaction with a further moiety, e.g. in 
the sense of a ligand receptor interaction, antigen antibody interaction or the like. The 
tagged protein can also be expressed in an amount in the target cell which comes close 
to the physiological concentration in order to avoid a complex formation merely due to 
high concentration of the expressed protein but not reflecting the natural occurring 
complex. 
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In a further preferred embodiment, the composition is obtained by using a 
tagged protein which comprises two different tags which allow two different affinity 
purification steps. This measure allows a higher degree of purification of the composition 
in question. In a further preferred embodiment the tagged protein comprises two tags 
that are separated by a cleavage site for a protease. This allows a step-by-step 
purification on affinity columns. 

Furthermore, the invention relates to a complex as described above and/or 
protein thereof as a target for an active agent of a pharmaceutical, preferably a drug 
target in the treatment or prevention of disease or disorder, preferably diseases or 
disorders as mentioned above.. 

Furthermore, the invention relates to the proteins Ycl046w (SEQ ID: 59), Ygr156w 
(SEQ ID: 61), Yhl035c (SEQ ID:63), Ykl018w(SEQ ID:179), Ylr221c (SEQ ID: 67), 
Yml030w(SEQ ID:69), and Yor17c (SEQ ID:71), the mammalian homologs/orthologs of 
said proteins and functionally active fragments and derivatives of said proteins and the 
mammalian homologs thereof carrying one or more amino acid substitutions, deletions 
and/or additions and the nucleic acid encoding said proteins or said homologs, orthologs 
and functionally active fragments and derivatives thereof. 

Such a nucleic acid may be used for example to express a desired tagged protein 
in a given cell for the isolation of a complex or component according to the invention. 
Such a nucleic acid may also be used for the identification and isolation of genes from 
other organisms by cross species hybridization. 

The present invention further relates to a construct, preferably a vector 
construct, which comprises a nucleic acid as described above. Such constructs may 
comprise expression controlling elements such as promoters, enhancers and terminators 
in order to express the nucleic acids in a given host cell, preferably under conditions 
which resemble the physiological concentrations. 

The present invention further relates to a host cell containing a construct as 
defined above. 

Such a host cell can be, e.g., any eukaryotic cell such as yeast, plant or 
mammalian, whereas human cells are preferred. Such host cells may form the starting 
material for isolation of a complex according to the present invention. 

Animal models and methods of screening for modulators (i.e., agonists, and 
antagonists) of the amount of, activity of, or protein component composition of, a 
complex of the present invention are also provided. 
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3.1 DEFINITIONS 

The term "mammalian homolog" or “homologous gene products” as used herein 
means a component protein of the cleavage/polyadenylation machinery of a mammal 
which performs the same function as the corresponding yeast protein. Such homologs 
are also termed "orthologue gene products". The algorithm for the detection of 
orthologue gene pairs from yeast and mammalian and human uses the whole genome of 
these organisms. First, pairwise best hits are retrieved, using a full Smith-Waterman 
alignment of predicted proteins. To further improve reliability, these pairs are clustered 
with pairwise best hits involving Drosophila melanogaster and C. elegans proteins. Such 
analysis is given, e.g., in Nature, 2001 , 409:860-921. The mammalian homologs of the 
yeast proteins according to the invention can either be isolated based on the sequence 
homology of the yeast genes to the mammalian genes by cloning the respective gene 
applying conventional technology and expressing the protein from such gene, or by 
isolating the mammalian proteins by isolating the analogous complex according to 
methods commonly known in the art, and as described in Section 6, infra. 

The term "protein complex machinery” as used herein means a complex of 
proteins in the cell that is able to perform one or more functions of the wild type protein 
complex. The protein complex may or may not include and/or be associated with other . 
molecules such as nucleic acid, such as RNA or DNA, or lipids. 

As used herein, the term "percent identity” means the number of identical 
residues as defined by an optimal alignment using the Smith-Waterman algorithm 
divided by the length of the overlap multiplied by 100. The alignment is performed by the 
search program (W.R. Pearson, 1991, Genomics 11:635-650) with the constraint to align 
the maximum of both sequences. 

As used herein, the term “derivatives” or “analogs of component proteins “ or 
“variants” include, but are not limited, to molecules comprising regions that are 
substantially homologous to the component proteins, in various embodiments, by at least 
30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% identity over an amino acid sequence of 
identical size or when compared to an aligned sequence in which the alignment is done 
by a computer homology program known in the art, or whose encoding nucleic acid is 
capable of hybridizing to a sequence encoding the component protein under stringent, 
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moderately stringent, or nonstringent conditions. It means a protein which is the outcome 
of a modification of the naturally occurring protein, by amino acid substitutions, deletions 
and additios, respectively, which derivatives still exhibit the biological function of the 
naturally occurring protein although not necessarily to the same degree. The biological 
function of such proteins can e.g. be examined by available in vitro 
cleavage/polyadenylation assays as will be described below. 

As used herein, the term "Therapeutics" includes, but are not limited to, a protein 
complex of the present invention, the individual component proteins, and analogs and 
derivatives (including fragments) of the foregoing (e.g., as described hereinabove); 
antibodies thereto (as described hereinabove); nucleic acids encoding the component 
protein, and analogs or derivatives, thereof (e.g., as described hereinabove); component 
protein antisense nucleic acids, and agents that modulate complex formation and/or 
activity (i.e., agonists and antagonists). 

"Target for therapeutic drug" means that the respective protein (target) can bind 
the active ingredient of a pharmaceutical composition and thereby changes its biological 
activity in response to the drug binding. 

"Effector of the cleavage/polyadenylation of precursor mRNA" means a 
compound that is capable of binding to a member of the cleavage/polyadenylation 
machinery thereby altering the cleavage/polyadenylation activity of the complex. This 
altering can be a reduction or increase in cleavage/polyadenylation activity. 

The terms “polyadenylation complex", “cleavage/polyadenylation machinery” and 
“cleavage/polyadenylation complex” are used interchangeably herein. 

4. BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. :1 shows elements of mammalian and yeast mRNA, respectively, which are involved 
in polyadenylation/cleavage of precursor mRNA 

Fig. 2 shows a schematic representation of the gene targeting procedure. The TAP 
cassette is inserted at the C-terminus of a given yeast ORF by homologous 
recombination, generating the TAP-tagged fusion protein. 

Rg. 3. showns the protein pattern obtained by separation of the members of the 
polyadenylation-complex of yeast using Ptal as a bait using TAP. Protein bands for Cftl, 
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Cft2, Yshl, Rna14, Pabl, Pcfll, Ref2, Papl, Clpl, YKL059C, Pfs2, YGR156w, Fipl, 
Rna15, YKL018w, Glc7, Ythl, Ssu72, YOR179C and Ptal (in bold) are labeled. (Further 
proteins identified as components of the yeast complex as described in the EXAMPLES- 
section (infra) are not stated in the figure) • 

Fig. 4 shows the protein pattern obtained by the separation of the members of the 
polyadenylation-complex in some of the reverse tagging-experiments and re-purification 
of a selection of the novel interactors. The baits using TAP used for the different 
experiments are given on top of each gel picture. The band constituing the protein used 
as the bait in the respective experiments is indicated by an arrow. Previously known 
members of the complex are listed in bold letters. (Note: only experiments using Cftl, 
Cft2, Papl , Ref2, YKL059c, Pfs2, YOR179c and Ptal as a bait are shown and only the 
proteins bands of Cftl, Cft2, Yshl, Rna14, Pabl, Ref2, CLpI, Ygr156w, Fipl, Glc7, 

Yhtl, Yor179c, Ptal, Pcfl 1 , Pabl , Ykl059c, Pfs2, Rna15, Ykl018w and Ssu72 are 
labelled). 

5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to components of the cleavage/polyadenylation machinery 
of pre-cursor mRNA, the complete protein complex, uses of said components and 
complex as well as to methods of preparing same. 

This is further described below. Also a description of the newly identified components of 
the cleavage/polyadenylation machinery is given below. 

In more detail, the present invention relates to the following embodiments: 

An isolated complex selected from 
complex (I) and comprising 

(a) a first protein, or a functionally active fragment or functionally active derivative 
thereof, which first protein is selected from the group of proteins in Table 1, column A, or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic acid that 
hybridizes to the nucleic acid of said protein or its complement under low stringency 
conditions; and 

(b) a second protein, or a functionally active fragment or functionally active 
derivative thereof, which second protein is selected from the group of proteins in Table 1 , 
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column B, or a mammalian homolog thereof, or a variant of said protein encoded by a 
nucleic acid that hybridizes to the nucleic acid of said protein or its complement under 
low stringency conditions, wherein said first protein and said second protein are 
members of a native cellular Polyadenylation-complex, and wherein said low stringency 
conditions comprise hybridization in a buffer comprising 35% formamide, 5X SSC, 50 
mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml 
denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 40°C, 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 
0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C, and a complex (II) 
comprising at least two second proteins. 

The present invention further relates to a new protein complex which is useful for 
cleaving and/or polyadenylating a nucleic acid and which complex comprises at least one 
of the components according to the invention. Such a complex can be isolated from a 
natural source by applying the process according to the invention or can be reconstituted 
from the different components made available by the present invention. 

Furthermore, the invention relates to an isolated complex comprising all 
proteins in column C of table 1 , or the mammalian homologs of those proteins, or 
variants of said proteins encoded by nucleic acid that hybridises to the nucleic acid of 
any of said proteins or its complements under low stringency conditions, wherein 
proteins.are members of a native cellular complex, and wherein said low stringency 
conditions comprise hybridization in a buffer comprising 35% formamide, 5X SSC, 50 
mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml 
denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 40°C, 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 
0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to an isolated complex that comprises all but 
1,2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17,18,19,20,21,22,23,24,25,26,27 or 28 
of all proteins in column C of table 1. 

Furthermore, the invention relates to the complex as described above comprising 
a functionally active derivative of said first protein and/or a functionaliy active derivative 
of said second protein, wherein the functionally active derivative is a fusion protein 
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comprising said first protein or said second protein fused to an amino acid sequence 
different from the first protein or second protein, respectively. 

The present invention further relates to a fusion protein comprising a component 
according to the invention. The fusion part, which can be added to the N-terminal, the C- 
terminal or into the amino acid sequence of the component according to the invention 
may comprise a few amino acids only e.g. at least five, which amino acids for example 
provide an epitope which is then be used as a target for affinity purification of the protein 
and the complex, respectively. Such a type of added amino acid is also termed "tag" 
throughout the present specification (optionally, the fusion protein may comprise even 
more than one such fusion partner). 

In a preferred embodiment of the present invention, the protein components of the 
complex are vertebrate homologs of the yeast proteins, or a mixture of yeast and 
vertebrate homolog proteins. In a more preferred embodiment, the protein components 
of the complex are mammalian homologs of the yeast proteins, or a mixture of yeast and 
mammalian homolog proteins. In particular aspects, n the native component proteins, or 
derivatives or fragments of the complex are obtained from a mammal such as mouse, 
rat, pig, cow, dog, monkey, human, sheep or horse. In another preferred embodiment, 
the protein components of the complex are human homologs of the yeast proteins, or a 
mixture of yeast and human homolog proteins. In yet another preferred embodiment, the 
protein components of the complex are a mixture of yeast, vertebrate, mammalian and/or 
human proteins. 

The mammalian homologs or "orthologues" of the yeast proteins according to the 
invention can either be isolated based on the sequence homology of the yeast genes to 
the mammalian genes by cloning the respective gene applying conventional technology 
and expressing the protein from such gene or by isolating the mammalian proteins 
according to the process of the invention as explained in more detail below. 

The derivatives of the proteins according to the invention can be produced e.g. by 
recombinant DNA technology applying the standard technology to modify the amino acid 
sequence of a given protein via the modification of the underlying gene using e.g. site 
directed mutagenesis, etc. 

A protein that shows a certain degree of identity to the naturally occurring proteins 
from mammals and/or yeast, respectively, can also be prepared e.g. by applying 
recombinant DNA technology as described above for the derivatives according to the 
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invention. Alternatively, such protein can be isolated from natural sources by applying the 
process of the invention. 

Furthermore, the invention relates to a complex as described above that is 
involved in the 3' end processing activity. Such a complex might also exist as a module 
or subcomplex of a larger physiological protein complex or assembly. 

Furthermore, the invention relates to a complex as described above comprising a 
fragment of said first protein and/or a fragment of said second protein, which fragment 
binds to another protein component of said complex. 

Furthermore, the invention relates to a complex as described above, wherein the 
functionally active derivative is a fusion protein comprising said first protein or said 
second protein preferentially fused to an affinity tag or label. 

It is further directed to complexes comprising a fusion protein which comprises a 
component of the complex or a fragment thereof linked via a covalent bond to an amino 
acid sequence different from said component protein, as well as nucleic acids encoding 
the protein, fusions and fragments thereof. For example, the non-component protein 
portion of the fusion protein, which can be added to the N-terminal, the C-terminal or 
inserted into the amino acid sequence of the complex component can comprise a few 
amino acids, which provide an epitope that is used as a target for affinity purification of 
the fusion protein and/or complex. 

Furthermore the invention relates to a process for processing RNA comprising the 
step of bringing into contact any of the complexes described above with RNA, such that 
RNA is processed. 

Furthermore, the invention relates to an antibody or a fragment of said antibody 
containing the binding domain thereof, which binds the complex as described above of 
claim and which does not bind the first protein when uncomplexed or the second protein 
when uncomplexed. 

Furthermore, the invention relates to a pharmaceutical composition comprising 
the protein complex as described above and a pharmaceutically acceptable carrier. 

Moreover, the present invention provides a process for the identification and/or 
preparation of an effector of a composition according to the invention which process 
comprises the steps of bringing into contact the composition of the invention or of a 
component thereof with a compound, a mixture of compounds or a library of compounds 
and determining whether the compounds or certain compounds of the mixture or library 
bind to the composition of the invention and/or a component thereof and/or affects the 
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biological activity of such a composition or component and then optionally further 
purifying the compound positively tested as effector by such a process. 

A major application of the composition according to the invention results in the 
identification of an active agent capable of binding thereto. Hence, the compositions of 
the invention are useful tools in screening for new pharmaceutical drugs. 

Furthermore, the invention relates to a method for screening for a molecule that 
modulates directly or indirectly the function, activity, composition or formation of the 
complex as described above comprising the steps of : 

(a) exposing said complex, or a cell or organism containing said complex to one or more 
candidate molecules; and 

(b) determining the amount of, the 3' end processing activity for mRNA of, or protein 
components of, said complex, wherein a change in said amount, activity, or protein 
components relative to said amount, activity or protein components in the absence of 
said candidate molecules indicates that the molecules modulate function, activity or 
composition of said complex. 

Furthermore, the invention relates to a method as described above, wherein the 
amount of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein the 
activity of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises isolating from the cell or organism said complex to produce 
said isolated complex and contacting said isolating complex with the substrate under 
conditions conducive to binding to the complex. 

Furthermore, the invention relates to a method as described above, wherein the 
protein components of said complex are determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises determining whether any of the proteins listed in column B of 
table 1 of said complex or the mammalian homologs thereof, or variant of said proteins 
encoded by a nucleic acid that hybridises to the nucleic acids of any of said proteins or 
its complements under low stringency conditions, is present in the complex, wherein said 
low stringency conditions comprise hybridization in a buffer comprising 35% formamide, 
5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 
100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 
hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM 
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EDTA, and 0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to a method as described above, wherein said 
method is a method of screening for a drug for treatment or prevention of diseases and 
disorders, preferably diseases or disorders such as infectious diseases; viral infections 
such as herpes simplex infections, Epstein-Barr-infections, influenza; metabolic disease 
such as metachromatic leukodystrophy; neurodegenerative disorders such as 
amyotrophic lateral sclerosis and cancer. 

Furthermore, the invention relates to a method for screening for a molecule that 
binds the complex as described above comprising the following steps: 

(a) exposing said complex, or a cell or organism containing said complex, to one or more 
candidate molecules; and 

(b) determining whether said complex is bound by any of said candidate molecules. 

Furthermore, the invention relates to a method for diagnosing or screening for the 
presence of a disease or disorder or a predisposition for developing a disease or 
disorder in a subject, which disease or disorder is characterized by an aberrant amount 
of, the 3' end processing activity for mRNA biochemical activity of, or component 
composition or formation of, the complex as described above, comprising determining 
the amount of, the 3’ end processing activity for mRNA of, or protein components of, said 
complex in a sample derived from a subject, wherein a difference in said amount, 
activity, or protein components of, said complex in an analogous sample from a subject 
not having the disease or disorder or predisposition indicates the presence in the subject 
of the disease or disorder or predisposition. 

Furthermore, the invention relates to a method as described above, wherein the 
amount of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein the 
activity of said complex is determined. 

Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises isolating from the cell or organism said complex to produce 
said isolated complex and contacting said isolating complex with the substrate under 
conditions conducive to binding to the complex. 

Furthermore, the invention relates to a method as described above, wherein the 
protein components of said complex are determined. 
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Furthermore, the invention relates to a method as described above, wherein said 
determining step comprises determining whether any of the proteins listed in column B of 
table 1 of said complex or the mammalian homologs thereof, or variant of said proteins 
encoded by a nucleic acid that hybridises to the nucleic acids of any of said proteins or 
its complements under low stringency conditions, is present in the complex, wherein said 
low stringency conditions comprise hybridization in a buffer comprising 35% formamide, 
5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 
100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 
hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM 
EDTA, and 0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to a method for treating or preventing a disease 
or disorder characterized by an aberrant amount of, the 3' end processing activity for 
mRNA of, or component composition or formation of, the complex as described above, 
comprising administering to a subject in need of such treatment or prevention a 
therapeutically effective amount of one or more molecules that modulate the amount of, 
the 3' end processing activity for mRNA of, or protein components or formation of, said 
complex. 

Furthermore, the invention relates to a method as described above, wherein said 
disease or disorder involves decreased levels of the amount or activity of said complex. 
Furthermore, the invention relates to a method as described above, wherein said disease 
or disorder involves increased levels of the amount or activity of said complex. 

Furthermore, the invention relates to the use of a molecule that modulates the 
amount of, the 3' end processing activity for mRNA of, or protein components or 
formation of the complex as described above for the manufacture of a medicament for 
the treatment or prevention of a disease or disorder, preferably diseases or disorders 
such as infectious diseases; viral infections such as herpes simplex infections, Epstein- 
Barr-infections, influenza; metabolic disease such as metachromatic leukodystrophy; 
neurodegenerative disorders such as amyotrophic lateral sclerosis; cancer 

The present invention further relates to the use of the products according to the 
invention in therapy wherein the products according to the invention are useful as a 
target for a therapeutic drug. It is known from the prior art that mRNA 3'-end processing 
is involved in viral growth, in the development of cancer and in certain neurodegenerative 
diseases. By having identified new components of the cleavage/polyadenylation 
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machinery the present invention, hence, offers new targets for treating viral diseases, 
cancer and neurodegenerative diseases. By affecting the biological activity of the 
components of the invention and/or by affecting the complex as a whole the 
cleavage/polyadenylation activity thereof can be influenced depending by the needs of 
the patient to be treated. 

The present invention further relates to a pharmaceutical composition comprising 
a product according to the invention. Such pharmaceutical composition contains beside 
the product according to the invention as active ingredient further excipients and 
additives as known by a skilled person. The present invention, hence, allows the 
identification of new effectors which affect the biological activity of the 
cleavage/polyadenylation machinery of precursor RNA. Said effectors than can be used 
to modify the cleavage/polyadenylation machinery in a cell by introducing an effector into 
a cell. Moreover, the mRNA processing activity of a given cell can also be affected by 
introducing a product according to the invention into such cell. 

Furthermore, the invention relates to a kit comprising in one or more containers 

(a) an isolated first protein, or a functionally active fragment or functionally active 
derivative thereof selected from the proteins in column A of table 1 of a given complex or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic acid that 
hybridises to the nucleic acid of said protein or its complement under low stringency 
conditions; and 

(b) an isolated second protein, or a functionally active fragment or functionally active 
derivative thereof selected from the proteins in column B of table 1 of a given complex or 
a mammalian homolog thereof, or a variant of said protein encoded by a nucleic acid that 
hybridises to the nucleic acid of said protein or its complement under low stringency 
conditions, wherein said first and said second protein are members of a native cellular 
complex, and wherein said low stringency conditions comprise hybridization in a buffer 
comprising 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% 

PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% 
(wt/vol) dextran sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X 
SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1.5 hours at 55°C, and 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 
0.1% SDS for 1.5 hours at 60°C. 

Furthermore, the invention relates to a kit comprising in a container the isolated 
complex as described above or the antibody as described above, optionally together with 
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further reagents and working instructions. The further reagents may be, for example, 
buffers, substrates for enzymes but also carrier material such as beads, filters, 
microarrays and other solid carries. The working instructions may indicate how to use 
the ingredients of the kit in order to perform a desired assay.. 

Furthermore, the invention relates to such kits for use in processing of RNA and 
for use in the diagnosis, prognosis and screening in or for the diseases mentioned 
above. 

The present invention further relates to a kit for processing RNA which kit 
comprises a product according to the invention. Such a kit may contain e.g. expression 
vectors encoding the essential components of the cleavage/polyadenylation machinery 
which components after being expressed can be reconstituted in order to form a 
biologically active cleavage/polyadenylation complex. Such a kit preferably also contains 
the required buffers and reagents together with the working instructions. 

The present invention further relates to a kit for the diagnosis of diseases of 
mammals which kit comprises a product according to the invention. As stated above the 
polyadenylation/cleavage machinery is involved in a large number of diseases. If said 
machinery activity is changed due to e.g. mutations of some components thereof and/or 
effectors, this may have severe implications on the affected organism. The kit according 
to the present invention containing the products according to the invention allows to 
examine as to whether the cleavage/polyadenylation machinery in a given sample might 
show some defects. Such a kit may be used to determine genetic defects of the genes 
encoding the components of the cleavage/polyadenylation machinery. 

Furthermore, the invention relates to a complex as described above, or the 
antibody or fragment as described above, for use in a method of diagnosing a disease or 
disorder, preferably the diseases or disorders as mentioned above. 

Furthermore, the invention relates to a method for the production of a 
pharmaceutical composition comprising carrying out the method as described above to 
identify a molecule that modulates the function, activity or formation of said complex, and 
further comprising mixing the identified molecule with a pharmaceutically acceptable 
carrier. 

Furthermore, the invention relates to a process for preparing complex as 
described above and optionally the components thereof comprising the following steps: 
expressing such a protein in a target cell, isolating the protein complex which is attached 
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to the tagged protein, and optionally disassociating the protein complex and isolating the 
individual complex members. 

Furthermore, the invention relates to the process as described above 
characterized in that the tagged protein comprises two different tags which allow two 
separate affinity purification steps. 

Furthermore, the invention relates to the process as described above, 
characterized in that two tags are separated by a cleavage site for a protease. 

Furthermore, the invention relates to a component of the said complex obtainable 
by a process as described above. 

The present invention further relates to a composition, preferably a protein 
complex, which is obtainable by the method comprising the following steps: tagging a 
protein as defined above, i.e. a protein which forms part of a protein complex, with a 
moiety, preferably an amino acid sequence, that allows affinity purification of the tagged 
protein and expressing such protein in a target cell and isolating the protein complex 
which is attached to the tagged protein. The details of such purification are described in 
WO 00/09716 and in Rigaut, G. et al. (1999), Nature Biotechnology, Vol. 17 (10): 1030- 
1032 and further herein below. The tagging can essentially be performed with any 
moiety which is capable of providing a specific interaction with a further moiety, e.g. in 
the sense of a ligand receptor interaction, antigen antibody interaction or the like. The 
tagged protein can also be expressed in an amount in the target cell which comes close 
to the physiological concentration in order to avoid a complex formation merely due to 
high concentration of the expressed protein but not reflecting the natural occurring 
complex. 

In a further preferred embodiment, the composition is obtained by using a 
tagged protein which comprises two different tags which allow two different affinity 
purification steps. This measure allows a higher degree of purification of the composition 
in question. In a further preferred embodiment the tagged protein comprises two tags 
that are separated by a cleavage site for a protease. This allows a step-by-step 
purification on affinity columns. 

Furthermore, the invention relates to a complex as described above and/or 
protein thereof as a target for an active agent of a pharmaceutical, preferably a drug 
target in the treatment or prevention of disease or disorder, preferably diseases or 
disorders as mentioned above.. 
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Furthermore, the invention relates to the Ycl046w (SEQ ID: 59), Ygr156w (SEQ 
ID: 61), Yhl035c (SEQ ID:63), Ykl018w(SEQ ID:179), Ylr221c (SEQ ID: 67), Yml030w 
(SEQ ID:69), and Yor17c (SEQ ID:71), the mammalian homologs/orthologs of said 
proteins and functionally active fragments and derivatives of said proteins and the 
mammalian homologs thereof carrying one or more amino acid substitutions, deletions 
and/or additions and the nucleic acid encoding said proteins or said homologs, orthologs 
and functionally active fragments and derivatives thereof. 

Such a nucleic acid may be used for example to express a desired tagged protein 
in a given cell for the isolation of a complex or component according to the invention. 
Such a nucleic acid may also be used for the identification and isolation of genes from 
other organisms by cross species hybridization. 

The component according to the invention is preferably a protein component 
which can be further modified e.g. by carbohydrate residues. The components can either 
be prepared by recombinant DNA technology based on the sequences provided by the 
present invention or can be isolated from a biological source by using the process 
according to the invention. 

The present invention further relates to a fusion protein comprising a component 
according to the invention. The fusion part, which can be added to the N-terminal, the C- 
terminal or into the amino acid sequence of the component according to the invention 
may comprise a few amino acids only e.g. at least five, which amino acids for example 
provide an epitope which is then be used as a target for affinity purification of the protein 
and the complex, respectively. Such a type of added amino acid is also termed "tag" 
throughout the present specification (optionally, the fusion protein may comprise even 
more than one such fusion partner). 

The present invention further relates to a construct, preferably a vector 
construct, which comprises a nucleic acid as described above. Such constructs may 
comprise expression controlling elements such as promoters, enhancers and terminators 
in order to express the nucleic acids in a given host cell, preferably under conditions 
which resemble the physiological concentrations. 

The present invention further relates to a construct which comprises the nucleic 
acid according to the invention and at least one further nucleic acid which is normally not 
associated with the nucleic acid according to the invention. Such a construct is preferably 
a vector which preferably is capable of replicating in a given cell and contains the 
necessary transcription control elements for expressing the nucleic acid according to the 
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invention in a given expression system. Moreover, such vector construct may contain 
selection markers. 

The present invention further relates to a host cell containing a construct as 
defined above. 

Such a host cell can be, e.g., any eukaryotic cell such as yeast, plant or 
mammalian, whereas human cells are preferred. Such host cells may form the starting 
material for isolation of a complex according to the present invention. 

The present invention also relates to a host cell containing a nucleic acid 
according to the invention or a construct according to the invention. Such a host cell may 
contain an expression vector which encodes a component according to the invention 
which component may serve as a bait in order to isolate the further proteins of the 
complex and which at least partly interact with the bait. Host cells can be prokaryotic 
and eukaryotic ceils, whereas mammalian host cells are preferred. 

Animal models and methods of screening for modulators (i.e., agonists, and 
antagonists) of the amount of, activity of, or protein component composition of, a 
complex of the present invention are also provided. 

Below is a more detailed list of the newly identified components of the polyadenylation 
complex (see also Tab. 1). The Accession-Number stated is the GenBank-Accession 
number for the protein. 

Actl: Is a known and essential protein (GenBank Acc. No. BAA21512.1), which has been 
shown to be involved in Pol II transcription and has been found to be associated with 
histone acetylation. It serves as a structural protein. 

Ckal: Is a known and non-essential protein (GenBank Acc. No. CAA86916.1), which has 
been found to be involved in Polymerase III transcription and has been found to be 
associated with the Casein kinase II complex. 

Eft2: The translation elongation factor EF-2 is a known protein involved in protein 
synthesis (GenBank AAB64827.1) 

Eno2: Is a known and essential protein (GenBank Acc. No. AAB68019.1). It has been 
shown to have lyase activity and is known to be involved in carbohydrate metabolism. 
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Glc7 (YER133w) is also a known protein (GenBank Acc. No. AAC03231.1). It is also an 
essential protein and is a Type I protein serine threonine phosphatase which has been 
implicated in distinct cellular roles, such as carbohydrate metabolism, meiosis, mitosis 
and cell polarity. Its occurrence in the cleavage/polyadenylation machinery has not been 
known before. 

Gpml: This protein is a phosphoglycerate mutase that converts 2-phosphoglyvcerate to 
3-phosphoglycerate in glycolysis. It is an essential protein (GenBank: CAA81 994.1) 

Hhf2: Is a known and non-essential protein (GenBank Acc. No. CAA95892.1) which has 
been shown to be involved in DNA-binding. It has previously been linked to Histone 
octamer and the RNA polymerase I upstream activation factor. 

Htal: Is a known and non-essential protein (GenBank Acc. No. CAA88505.1) which has 
DNA-binding capability and has been shown to be involved in polymerase II transcription. 

Hsc82: Is a non-essential protein so far being associated with protein folding. (GenBank 
Acc. No: CAA89919.1) 

Imd2: Is an lnosine-5'-monophosphate dehydrogenase so far being associated with 
nucleotide metabolism. It is non-essential. (GenBank Acc. -No.: AAB69728.1) 

Imd4: Is a non-essential protein with similiarity to Imd2 so far being associated with 
nucleotide metabolism (GenBank Acc-No.: CAA86719.1) 

Met6: Is a homocysteine methyltransferase so far being associated with amino-acid 
metabolism (GenBank Acc.-No.: AAB64646.1) 

Pdcl : Is a pyruvate decarboxylase isozymel so far being associated with carbohydrate 
metabolism (GenBank Acc.-No.: CAA97573.1) 

Pfkl: Is a known protein (GenBank Acc. No. CAA97268.1) which has previously been 
described as part of the phosphofructokinase complex. 
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Ref2 (YDR195w) is a known protein (GenBank Acc. No. CAA88708.1). It is a non- 
essential gene product. It has been shown to be involved in 3'-end formation prior to the 
final polyadenylation step. However, Ref2 has never been identified before as a 
component of the 3’-end processing machinery. Ref2 has been shown to interact with 
Glc7, another new component of the cleavage/poiyadenylation machinery. 

Sec13: Is a known and essential protein (GenBank Acc. No AAB67426.1). 

Sec31: Is a known and essential protein (GenBank Acc. No. CAA98772.1) 

Ssa3: Is a known and non-essential protein (GenBank Acc. No. CAA84896.1) which so 
far has been implicated with protein folding/protein transport. 

Ssu72 (YNL222w) is also a known protein (GenBank Acc. No. CAA96125.1) and is an 
essential phylogenetically conserved protein which has been shown to interact with the 
general transcription factor TFIIB (Sua7). TFIIB is an essential component of the RNA 
polymerase II (RNAP II) core transcriptional machinery. It is thought that this interaction 
plays a role in the mechanism of start site selection by RNAP II. The finding according to 
the present invention that Ssu72 is associated with Ptal is likely to be relevant since it is 
believed that mRNA 3-end formation is linked with other nuclear processes like 
transcription, capping and splicing. Furthermore, Ssu 72 has also been clearly identified 
in a "reverse tagging experiment" as explained herein below by using some of the Ptal 
associated proteins as bait However, when Ssu72 itself was used as a bait associated 
proteins were not found most likely due to the fact that the addition of a C-terminal tag 
renders Ssu72 non-functional. 

Taf60: Is a known and essential protein (GenBank Acc. No. CAA96819.1) which has 
been shown to be involved in Polymerase II transcription. 

Tkll : Is a non-essential transketolase so far being associated with amino-acid 
metabolism and carbohydrate metabolism (GenBank Acc-No.: CAA89191.1) 
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Tsai : Translation initiation factor elF5 which so far has been to shown to catalyze 
hydrolysis of GTP on the 40S ribosomal subunit-initiation complex followed by joining to 
60S ribosomal subunit. (GenBank Acc.-No.: CAA92145.1) 

Tye7: Is a known protein (GenBank Acc. No. CAA99671.1). It has been shown to be a 
basic helix-loop-helix transcription factor. 

Vid24: Is a known and non-essential protein (GenBank Acc. No. CAA89320.1) which has 
previously been associated with protein degradation and vesicular transport. 

Vps53: Is a known protein (GenBank Acc. No. CAA89320.1) which has been found to 
play a role in protein sorting. 

YCL046w: Is a non-essential protein (GenBank Acc. No. CAA42371.1). 

YGR156W is the protein product of an essential gene. This protein also contains a RNA 
binding motif. (GenBank Acc. No. CAA97170.1). 

YHL035c: Is a known and non-essential protein (GenBank Acc. No. AAB65047.1). It is a 
member of the ATP-binding cassette superfamily. 

YKL018W is also an essential protein containing a WD40 domain which is a typical 
protein binding domain. (GenBank Acc. No. CAA81 853.1) 

YLR221c: Is a protein of unknown function (GenBank Acc. No.AAB67410.1) 

YML030w: Is a protein of unknown function (GenBank Acc. No. CAA86625.1) 

YOR179c shows significant sequence similarity to Yshl (GenBank Acc. No. 

CAA99388.1 ) 

Two further proteins for which binary interactions with members of the polyadenylation 
complex as known so far have been shown before have also been purified with the 
complex: 
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YKL059c: is the product of an essential gene and is a zinc binding protein containing a 
C2HC Zinc finger. The presence of this domain predicts a RNA binding function of 
YKL059c. We believe the corresponding gene product is identical to Pfsl , a protein 
which has been mentioned in several publications, but which has never been annotated 
in the databases (for review see Keller, W. and Minvielle-Sebastia (1997). Curr Opin Cell 
Biol 11: 352-357). (GenBank Acc. No. CAA81896.1) 

Tif4632: Is a known and non-essential protein (GenBank Acc. No. CAA96751.1) which 
has been shown to have an RNA-binding/translation factor activity and is involved in 
protein synthesis. 



TABLES: 

Table 1 : Composition of the Complex (Cleavage/polyadenylation machinery): 

First column ('Entry point') lists the bait proteins (TAP-tag fusion proteins) that have 
been chosen for the isolation of the given complex. Note: in several cases, different baits 
have been used for validation in reverse tagging experiments. 

Second column ('Interactions') briefly lists any known interactions between different 
members of the complex (Abbrevations: '2-hybrid': interaction as identified in yeast-2- 
hybrid screens; 'far-western': interaction as identified in far-western experiments; 

'coipp': interaction as identified by co-immunoprecipitation experiments; 'high-throughput 
2 hybrid': interaction as identified by high-throughput yeast-2-hybrid screens; 
'copurification': interaction as identified by copurification experiments; 'immuno-affinity- 
columns': interaction as identified in experiments using immuno-affinity columns; 'in vitro 
binding': interaction as identified in in-vitro-binding experiments. If a core complex has 
been known previously containing several of the identified proteins, the name of the 
complex is stated. 

Third colum ('Proteins found') lists all proteins which have been identified in the 
particular complex. 

Fourth column (’COLUMN A, 'Known components of the complex') lists the components 
of the complex as found by Cellzome, which have been known to interact with other 
members of the complex as identified herein, (see also third column). 
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Firth column ('COLUMN B, 'Novel proteins') lists the novel members of the complex as 
provided in the invention. 

Sixth column ('Column C, cleavage/polyadenylation machinery'): lists again all 
components of the cleavage/polyadenylation machinery as identified herein 
Seventh column (COLUMN C, 'Activity of the complex'): List the biochemical activities of 
the newly identified complex. 

Eighth column (COLUMN D, 'Proteins of unknown function'): Separately lists again the 
members of the newly identified complex which previously have not been annotated. 
Ninth column ('localization') indicates the localization of the identified complex 
(Abbevations: c: cytoplasma; b: membrane; e: ER/Golgi/vesicles; m: mitochondria; n: 
nucleus; u: unknown) 



Table 2: Individual Yeast Proteins of the Complexes 

A) Table lists in alphabetical order all yeast proteins which have been identified as 
members of the complex presented herein. Furthermore, the SEQ ID of the proteins are 
listed as used herein. Further columns lists the Accession-Number of the respective 
sequences in MIPS, SWISS-PROT, SGD and Genbank. In addition, where applicable, 
the GenBank accession numbers of the respective orthologues in humans, C.elegans 
and Drosophila are listed. 

B) Table lists again the proteins and SEQ ID as in part A. In addition, the table contains 
an overview about what has been previously reported on the protein, the biochemical 
function thereof and the cellular function thereof as stated in YPD (Constanzo, M.C. et 
al., 2001, Nucl. Acid Res, 29: 75-9; Hodges, P.E. et al., 1999, Nucl. Acids Res 27: 69- 
73). 

Table 3: Medical Application of the Complex: 

First column ('Name of complex') lists again the name of the complex as used herein. 
Second column ('Cellular role') lists keyword on the cellular role of the complex 
Third column ('Medical applications') lists disorder, diseases, disease areas etc. which 
are treatable and/or preventable and/or diagnosable etc. by therapeutics and methods 
interacting with/acting via the complex. 
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Table 4: Characterization of previously undescribed individual proteins of the complexes: 
The table provides data on proteins which have not been annotated previously but which 
have now been linked to a functional complex as described in table 2. Names are listed 
on the left. In addition the table contains a list of motifs found by sequence analysis 
which has been part of the invention provided herein. Futhermore, the predicted known 
human orthologues are listed on the right (By SWISS-PROT Accession numbers). Used 
Abbrevations are listed at the end of the table. The function of the individual proteins as 
deduced from the association with the complex, the sequence analysis and the analysis 
of the predicted ortholgues is listed in the second column ('Putative function'). 

Tab!e5: Overview on Experimental Steps: The tables illustrates the construction of a 
yeast strain expressing a TAP-tagged bait in a high-throuphput fashion. 

Table 6: Known and Novel Components of the yeast mRNA 3'-end processing 
machinery (the cleavage/polyadenylation complex): Top part of the table states the 
different known subcomponents of the polyadenylation complex, the function thereof, the 
proteins constituting the different subcomplexes as known so far (including their 
molecular weight and sequence motifs contained in the protein). Bottom part lists the 
novel components of the complex as provided herein 



5.1. PROTEIN COMPLEXES 

The protein complexes of the present invention and their component proteins are 
described in the Tables 1,2, 3, 4, 6 (whereas Table 6 gives an overview on the 
construction of the yeast strains). The protein complexes and component proteins can 
be obtained by methods well known in the art for protein purification and recombinant 
protein expression. For example, the protein complexes of the present invention can be 
isolated using the TAP method described in Section 6, infra, and in WO 00/09716 and 
Rigaut et al., 1999, Nature Biotechnology 17:1030-1032, which are each incorporated by 
reference in their entirety. Additionally, the protein complexes can be isolated by 
immunoprecipitation of the component proteins and combining the immunoprecipitated 
proteins. The protein complexes can also be produced by recombinantly expressing the 
component proteins and combining the expressed proteins. 
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The nucleic and amino acid sequences of the component proteins of the protein 
complexes of the present invention are provided herein (SEQ ID NOS:1-2670), and can 
be obtained by any method known in the art, e.g., by PCR amplification using synthetic 
primers hybridizable to the 3' and 5’ ends of each sequence, and/or by cloning from a 
cDNA or genomic library using an oligonucleotide specific for each nucleotide sequence. 

Homologs (e.g., nucleic acids encoding component proteins from other species) 
or other related sequences (e.g., variants, paralogs) which are members of a native 
cellular protein complex can be obtained by low, moderate or high stringency 
hybridization with all or a portion of the particular nucleic acid sequence as a probe, 
using methods well known in the art for nucleic acid hybridization and cloning. 

Exemplary moderately stringent hybridization conditions are as follows: 
prehybridization of filters containing DNA is carried out for 8 hours to overnight at 65DC 
in buffer composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 
0.02% Ficoll, 0.02% BSA, and 500 pg/ml denatured salmon sperm DNA. Filters are 
hybridized for 48 hours at 65 °C in prehybridization mixture containing 100 pg/ml 
denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of 
filters is done at 37 °C for 1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01% 
Ficoll, and 0.01% BSA. This is followed by a wash in 0.1X SSC at 50 °C for 45 min 
before autoradiography. Alternatively, exemplary conditions of high stringency are as 
follows: e.g., hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl 
sulfate (SDS), 1 mM EDTA at 65 °C, and washing in 0.1xSSC/0.1% SDS at 68 °C 
(Ausubel F.M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & sons, Inc., New York, at p. 2.10.3). Other 
conditions of high stringency which may be used are well known in the art. Exemplary 
low stringency hybridization conditions comprise hybridization in a buffer comprising 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 
0.2% BSA, 100 pg/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate 
for 18-20 hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 
7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 55°C, and washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 
hours at 60°C. 

For recombinant expression of one or more of the proteins, the nucleic acid 
containing all or a portion of the nucleotide sequence encoding the protein can be 
inserted into an appropriate expression vector, i.e., a vector that contains the necessary 
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elements for the transcription and translation of the inserted protein coding sequence. 
The necessary transcriptional and translational signals can also be supplied by the native 
promoter of the component protein gene, and/or flanking regions. 

A variety of host-vector systems may be utilized to express the protein coding 
sequence. These include but are not limited to mammalian cell systems infected with 
virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g., 
baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host- 
vector system utilized, any one of a number of suitable transcription and translation 
elements may be used. 

In a preferred embodiment, a complex of the present invention is obtained by 
expressing the entire coding sequences of the component proteins in the same cell, 
either under the control of the same promoter or separate promoters. In yet another 
embodiment, a derivative, fragment or homolog of a component protein is recombinantly 
expressed. Preferably the derivative, fragment or homolog of the protein forms a 
complex with the other components of the complex, and more preferably forms a 
complex that binds to an anti-complex antibody. 

The present invention further relates to an antibody which reacts with a product 
according to the invention. Such an antibody might be used e.g. during purification of the 
machinery from a given source by affinity purification methods. Moreover, the antibody 
might be used in diagnosis in order to detect changes and/or modifications of a product 
according to the invention in a given sample. 

Any method available in the art can be used for the insertion of DNA fragments 
into a vector to construct expression vectors containing a chimeric gene consisting of 
appropriate transcriptional/translational control signals and protein coding sequences. 
These methods may include in vitro recombinant DNA and synthetic techniques and in 
vivo recombinant techniques (genetic recombination). Expression of nucleic acid 
sequences encoding a component protein, or a derivative, fragment or homolog thereof, 
may be regulated by a second nucleic acid sequence so that the gene or fragment 
thereof is expressed in a host transformed with the recombinant DNA molecule(s). For 
example, expression of the proteins may be controlled by any promoter/enhancer known 
in the art. In a specific embodiment, the promoter is not native to the gene for the 
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component protein. Promoters that may be used can be selected from among the many 
known in the art, and are chosen so as to be operative in the selected host cell. 

In a specific embodiment, a vector is used that comprises a promoter operably 
linked to nucleic acid sequences encoding a component protein, or a fragment, derivative 
or homolog thereof, one or more origins of replication, and optionally, one or more 
selectable markers (e.g., an antibiotic resistance gene). 

In another specific embodiment, an expression vector containing the coding 
sequence, or a portion thereof, of a component protein, either together or separately, is 
made by subcloning the gene sequences into the EcoRI restriction site of each of the 
three pGEX vectors (glutathione S-transferase expression vectors; Smith and Johnson, 
1988, Gene 7:31-40). This allows for the expression of products in the correct reading 
frame. 

Expression vectors containing the sequences of interest can be identified by three 
general approaches: (a) nucleic acid hybridization, (b) presence or absence of "marker" 
gene function, and (c) expression of the inserted sequences. In the first approach, 
coding sequences can be detected by nucleic acid hybridization to probes comprising 
sequences homologous and complementary to the inserted sequences. In the second 
approach, the recombinant vector/host system can be identified and selected based 
upon the presence or absence of certain "marker" functions (e.g., resistance to 
antibiotics, occlusion body formation in baculovirus, etc.) caused by insertion of the 
sequences of interest in the vector. For example, if a component protein gene, or portion 
thereof, is inserted within the marker gene sequence of the vector, recombinants 
containing the encoded protein or portion will be identified by the absence of the marker 
gene function (e.g., loss of beta-galactosidase activity). In the third approach, 
recombinant expression vectors can be identified by assaying for the component protein 
expressed by the recombinant vector. Such assays can be based, for example, on the 
physical or functional properties of the interacting species in in vitro assay systems, e.g., 
formation of a complex comprising the protein or binding to an anti-complex antibody. 

Once recombinant component protein molecules are identified and the complexes 
or individual proteins isolated, several methods known in the art can be used to 
propagate them. Using a suitable host system and growth conditions, recombinant 
expression vectors can be propagated and amplified in quantity. As previously 
described, the expression vectors or derivatives which can be used include, but are not 
limited to, human or animal viruses such as vaccinia virus or adenovirus; insect viruses 




WO 02/092626 



PCT/EP02/05359 



43 

such as baculovirus, yeast vectors; bacteriophage vectors such as lambda phage; and 
plasmid and cosmid vectors. 

In addition, a host cell strain may be chosen that modulates the expression of the 
inserted sequences, or modifies or processes the expressed proteins in the specific 
fashion desired. Expression from certain promoters can be elevated in the presence of 
certain inducers; thus expression of the genetically-engineered component proteins may 
be controlled. Furthermore, different host cells have characteristic and specific 
mechanisms for the translational and post-translational processing and modification 
(e.g., glycosylation, phosphorylation, etc.) of proteins. Appropriate cell lines or host 
systems can be chosen to ensure that the desired modification and processing of the 
foreign protein is achieved. For example, expression in a bacterial system can be used 
to produce an unglycosylated core protein, while expression in mammalian cells ensures 
"native" glycosylation of a heterologous protein. Furthermore, different vector/host 
expression systems may effect processing reactions to different extents. 

In other specific embodiments, a component protein or a fragment, homolog or 
derivative thereof, may be expressed as fusion or chimeric protein product comprising 
the protein, fragment, homolog, or derivative joined via a peptide bond to a heterologous 
protein sequence of a different protein. Such chimeric products can be made by ligating 
the appropriate nucleic acid sequences encoding the desired amino acids to each other 
by methods known in the art, in the proper coding frame, and expressing the chimeric 
products in a suitable host by methods commonly known in the art. Alternatively, such a 
chimeric product can be made by protein synthetic techniques, e.g., by use of a peptide 
synthesizer. Chimeric genes comprising a portion of a component protein fused to any 
heterologous protein-encoding sequences may be constructed. 

In particular, protein component derivatives can be made by altering their 
sequences by substitutions, additions or deletions that provide for functionally equivalent 
molecules. Due to the degeneracy of nucleotide coding sequences, other DNA 
sequences that encode substantially the same amino acid sequence as a component 
gene or cDNA can be used in the practice of the present invention. These include but 
are not limited to nucleotide sequences comprising all or portions of the component 
protein gene that are altered by the substitution of different codons that encode a 
functionally equivalent amino acid residue within the sequence, thus producing a silent 
change. Likewise, the derivatives of the invention include, but are not limited to, those 
containing, as a primary amino acid sequence, all or part of the amino acid sequence of 
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a component protein, including altered sequences in which functionally equivalent amino 
acid residues are substituted for residues within the sequence resulting in a silent 
change. For example, one or more amino acid residues within the sequence can be 
substituted by another amino acid of a similar polarity that acts as a functional 
equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid 
belongs. For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan and methionine. The polar neutral 
amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids include arginine, lysine and 
histidine. The negatively charged (acidic) amino acids include aspartic acid and glutamic 
add. 

In a specific embodiment, up to 1%, 2%, 5%, 10%, 15% or 20% of the total 
number of amino acids in the wild type protein are substituted or deleted; or 1 , 2, 3, 4, 5, 
or 6 amino acids are inserted, substituted or deleted relative to the wild type protein. 

In a specific embodiment of the invention, the nucleic acids encoding a protein 
component and protein components consisting of or comprising a fragment of or 
consisting of at least 6 (continuous) amino acids of the protein are provided. In other 
embodiments, the fragment consists of at least 10, 20, 30, 40, or 50 amino acids of the 
component protein. In specific embodiments, such fragments are not larger than 35, 100 
or 200 amino acids. Derivatives or analogs of component proteins include, but are not 
limited, to molecules comprising regions that are substantially homologous to the 
component proteins, in various embodiments, by at least 30%, 40%, 50%, 60%, 70%, 
80%, 90% or 95% identity over an amino acid sequence of identical size or when 
compared to an aligned sequence in which the alignment is done by a computer 
homology program known in the art, or whose encoding nucleic acid is capable of 
hybridizing to a sequence encoding the component protein under stringent, moderately 
stringent, or nonstringent conditions. 

The protein component derivatives and analogs of the invention can be produced 
by various methods known in the art. The manipulations which result in their production 
can occur at the gene or protein level. For example, the cloned gene sequences can be 
modified by any of numerous strategies known in the art (Sambrook et al., 1989, 
Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York). The sequences can be cleaved at appropriate sites with 
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restriction endonuclease(s), followed by further enzymatic modification if desired, 
isolated, and ligated in vitro. In the production of the gene encoding a derivative, 
homolog or analog of a component protein, care should be taken to ensure that the 
modified gene retains the original translational reading frame, uninterrupted by 
translational stop signals, In the gene region where the desired activity is encoded. 

Additionally, the encoding nucleic acid sequence can be mutated in vitro or in 
vivo, to create and/or destroy translation, initiation, and/or termination sequences, or to 
create variations in coding regions and/or form new restriction endonuclease sites or 
destroy pre-existing ones, to facilitate further in vitro modification. Any technique for 
mutagenesis known in the art can be used, including but not limited to, chemical 
mutagenesis and in vitro site-directed mutagenesis (Hutchinson et al., 1978, J. Biol. 
Chem 253:6551-6558), amplification with PCR primers containing a mutation, etc. 

Once a recombinant cell expressing a component protein, or fragment or 
derivative thereof, is identified, the individual gene product or complex can be isolated 
and analyzed. This is achieved by assays based on the physical and/or functional 
properties of the protein or complex, including, but not limited to, radioactive labeling of 
the product followed by analysis by gel electrophoresis, immunoassay, cross-linking to 
marker-labeled product, etc. 

The component proteins and complexes may be isolated and purified by standard 
methods known in the art (either from natural sources or recombinant host cells 
expressing the complexes or proteins), including but not restricted to column 
chromatography ( e.g ., ion exchange, affinity, gel exclusion, reversed-phase high 
pressure, fast protein liquid, etc.), differential centrifugation, differential solubility, or by 
any other standard technique used for the purification of proteins. Functional properties 
may be evaluated using any suitable assay known in the art. 

Alternatively, once a component protein or its derivative, is identified, the amino 
acid sequence of the protein can be deduced from the nucleic acid sequence of the 
chimeric gene from which it was encoded. As a result, the protein or its derivative can be 
synthesized by standard chemical methods known in the art (e.g., Hunkapiller et al., 

1984, Nature 310: 105-111). 

Manipulations of component protein sequences may be made at the protein level. 
Included within the scope of the invention is a complex in which the component proteins 
or derivatives and analogs that are differentially modified during or after translation, e.g., 
by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
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protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other 
cellular ligand, etc. Any of numerous chemical modifications may be carried out by 
known techniques, including but not limited to specific chemical cleavage by cyanogen 
bromide, trypsin, chymotrypsin, papain, V8 protease, NaBH 4 , acetylation, formylation, 
oxidation, reduction, metabolic synthesis in the presence of tunicamycin, etc. 

In specific embodiments, the amino acid sequences are modified to include a 
fluorescent label. In another specific embodiment, the protein sequences are modified to 
have a heterofunctional reagent; such heterofunctional reagents can be used to crosslink 
the members of the complex. 

In addition, complexes of analogs and derivatives of component proteins can be 
chemically synthesized. For example, a peptide corresponding to a portion of a 
component protein, which comprises the desired domain or mediates the desired activity 
in vitro (e.g., complex formation) can be synthesized by use of a peptide synthesizer. 
Furthermore, if desired, non-classical amino acids or chemical amino acid analogs can 
be introduced as a substitution or addition into the protein sequence. 

In cases where natural products are suspected of being mutant or are isolated 
from new species, the amino acid sequence of a component protein isolated from the 
natural source, as well as those expressed in vitro, or from synthesized expression 
vectors in vivo or in vitro, can be determined from analysis of the DNA sequence, or 
alternatively, by direct sequencing of the isolated protein. Such analysis can be 
performed by manual sequencing or through use of an automated amino acid 
sequenator. 

The complexes can also be analyzed by hydrophilicity analysis (Hopp and 
Woods, 1981, Proc. Natl. Acad. Sci. USA 78:3824-3828). A hydrophilicity profile can be 
used to identify the hydrophobic and hydrophilic regions of the proteins, and help predict 
their orientation in designing substrates for experimental manipulation, such as in binding 
experiments, antibody synthesis, etc. Secondary structural analysis can also be done to 
identify regions of the component proteins, or their derivatives, that assume specific 
structures (Chou and Fasman, 1974, Biochemistry 13:222-23). Manipulation, translation, 
secondary structure prediction, hydrophilicity and hydrophobicity profile predictions, open 
reading frame prediction and plotting, and determination of sequence homologies, etc., 
can be accomplished using computer software programs available in the art. 

Other methods of structural analysis including but not limited to X-ray 
crystallography (Engstrom, 1974 Biochem. Exp. Biol. 11:7-13), mass spectroscopy and 
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gas chromatography (Methods in Protein Science, J. Wiley and Sons, New York, 1997), 
and computer modeling (Fletterick and Zoller, eds., 1986, Computer Graphics and 
Molecular Modeling, In: Current Communications in Molecular Biology, Cold Spring 
Harbor Laboratory, Cold Spring Harbor Press, New York) can also be employed. 

5 2 . ANTIBODIES TO PROTEIN COMPLEXES 

According to the present invention, a protein complex of the present invention 
comprising a first protein, or a functionally active fragment or functionally active 
derivative thereof, selected from the group consisting of proteins listed in column A of 
table 1; and a second protein, or a functionally active fragment or functionally active 
derivative thereof, selected from the group consisting of proteins listed in column B of 
table 1 , or a functionally active fragment or functionally active derivative thereof, can be 
used as an immunogen to generate antibodies which immunospecifically bind such 
immunogen. According to the present invention, also a protein complex of the present 
invention can be used as an immunogen to generate antibodies which 
immunospecifically bind to such immunogen comprising all proteins listed in column C of 
table 1 

Such antibodies include, but are not limited to, polyclonal, monoclonal, 
chimeric, single chain, Fab fragments, and an Fab expression library. In a specific 
embodiment, antibodies to a complex comprising human protein components are 
produced. In another embodiment, a complex formed from a fragment of said first 
protein and a fragment of said second protein, which fragments contain the protein 
domain that interacts with the other member of the complex, are used as an immunogen 
for antibody production. In a preferred embodiment, the antibody specific for the 
complex in that the antibody does not bind the individual protein components of the 
complex. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 
polyclonal antibody compositions are ones that have been selected for antibodies 
directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 
immunogen compositions made using a non-human host cell for recombinant expression 
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of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will be 
present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 
standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be isolated from 
the mammal (e.g., from the blood) and further purified by well-known techniques, such as 
protein A chromatography to obtain the IgG fraction. Alternatively, antibodies specific for 
a protein or polypeptide of the invention can be selected for (e.g., partially purified) or 
purified by, e.g., affinity chromatography. For example, a recombinantly expressed and 
purified (or partially purified) protein of the invention is produced as described herein, 
and covalently or non-covalently coupled to a solid support such as, for example, a 
chromatography column. The column can then be used to affinity purify antibodies 
specific for the proteins of the invention from a sample containing antibodies directed 
against a large number of different epitopes, thereby generating a substantially purified 
antibody composition, i.e., one that js substantially free of contaminating antibodies. By 
a substantially purified antibody composition is meant, in this context, that the antibody 
sample contains at most only 30% (by dry weight) of contaminating antibodies directed 
against epitopes other than those on the desired protein or polypeptide of the invention, 
and preferably at most 20%, yet more preferably at most 10%, and most preferably at 
most 5% (by dry weight) of the sample is contaminating antibodies. A purified antibody 
composition means that at least 99% of the antibodies in the composition are directed 
against the desired protein or polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein, 1975, Nature 256:495-497, the 
human B cell hybridoma technique (Kozbor et al., 1983, Immunol. Today 4:72), the 
EBV-hybridoma technique (Cole et al., 1985, Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for 
producing hybridomas is well known (see generally Current Protocols in Immunology 
(1994) Coligan et al. (eds.) John Wiley & Sons, Inc., New York, NY). Hybridoma cells 
producing a monoclonal antibody of the invention are detected by screening the 




WO 02/092626 



PCT/EP02/05359 



49 

hybridoma culture supernatants for antibodies that bind the polypeptide of interest, e.g., 
using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
antibody directed against a polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody phage 
display library) with the polypeptide of interest. Kits for generating and screening phage 
display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP Phage Display 
Kit, Catalog No. 240612). Additionally, examples of methods and reagents particularly 
amenable for use in generating and screening antibody display library can be found in, 
for example, U.S. Patent No. 5,223,409; PCT Publication No. WO 92/18619; PCT 
Publication No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 92/01047; 
PCT Publication No. WO 92/09690; PCT Publication No. WO 90/02809; Fuchs et al., 
1991, Bio/Technology 9:1370-1372; Hay et al., 1992, Hum. Antibod. Hybridomas 
3:81-85; Huse et al., 1989, Science 246:1275-1281; Griffiths et al., 1993, EMBO J. 
12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 
made using standard recombinant DNA techniques, are within the scope of the invention. 
A chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. (See, e.g., Cabilly et al., U.S. Patent No. 
4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are incorporated herein by 
reference in their entirety.) Humanized antibodies are antibody molecules from non- 
human species having one or more complementarily determining regions (CDRs) from 
the non-human species and a framework region from a human immunoglobulin 
molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is incorporated herein by 
reference in its entirety.) Such chimeric and humanized monoclonal antibodies can be 
produced by recombinant DNA techniques known in the art, for example using methods 
described in PCT Publication No. WO 87/02671; European Patent Application 184,187; 
European Patent Application 171,496; European Patent Application 173,494; PCT 
Publication No. WO 86/01533; U.S. Patent No. 4,816,567; European Patent Application 
125,023; Better et al., 1988, Science 240:1041-1043; Liu et al., 1987, Proc. Natl. Acad. 
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Sci. USA 84:3439-3443; Liu eta!., 1987, J. Immunol. 139:3521-3526; Sun etal., 1987, 
Proc. Natl. Acad. Sci. USA 84:214-218; Nishimura et al., 1987, Cane. Res. 47:999-1005; 
Wood et al., 1985, Nature 314:446-449; and Shawet al., 1988, J. Natl. Cancer Inst. 
80:1553-1559); Morrison, 1985, Science 229:1202-1207; Oi etal., 1986, Bio/Techniques 
4:214; U.S. Patent 5,225,539; Jones et al., 1986, Nature 321:552-525; Verhoeyan et al., 
1988, Science 239:1534; and Beidler et al., 1988, J. Immunol. 141:4053-4060. 

Completely human antibodies are particularly desirable for therapeutic treatment 
of human patients. Such antibodies can be produced, for example, using transgenic 
mice which are incapable of expressing endogenous immunoglobulin heavy and light 
chains genes, but which can express human heavy and light chain genes. The 
transgenic mice are immunized in the normal fashion with a selected antigen, e.g., all or 
a portion of a polypeptide of the invention. Monoclonal antibodies directed against the 
antigen can be obtained using conventional hybridoma technology. The human 
immunoglobulin transgenes harbored by the transgenic mice rearrange during B cell 
differentiation, and subsequently undergo class switching and somatic mutation. Thus, 
using such a technique, it is possible to produce therapeutically useful IgG, IgA and IgE 
antibodies. For an overview of this technology for producing human antibodies, see 
Lonberg and Huszar, 1995, Int. Rev. Immunol. 13:65-93). Fora detailed discussion of 
this technology for producing human antibodies and human monoclonal antibodies and 
protocols for producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. Patent 
5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 5,545,806. In 
addition, companies such as Abgenix, Inc. (Freemont, CA), can be engaged to provide 
human antibodies directed against a selected antigen using technology similar to that 
described above. 

Completely human antibodies which recognize a selected epitope can be 
generated using a technique referred to as “guided selection.” In this approach a 
selected non-human monoclonal antibody, e.g., a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope. (Jespers et al., 
1994, Bio/technology 12:899-903). 

Antibody fragments that contain the idiotypes of the complex can be generated by 
techniques known in the art. For example, such fragments include, but are not limited to, 
the F(ab’)2 fragment which can be produced by pepsin digestion of the antibody 
molecule; the Fab’ fragment that can be generated by reducing the disulfide bridges of 
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the F(ab')2 fragment; the Fab fragment that can be generated by treating the antibody 
molecular with papain and a reducing agent; and Fv fragments. 

In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e.g., ELISA (enzyme-linked 
immunosorbent assay). To select antibodies specific to a particular domain of the 
complex, or a derivative thereof, one may assay generated hybridomas for a product that 
binds to the fragment of the complex, or a derivative thereof, that contains such a 
domain. For selection of an antibody that specifically binds a complex of the present, or 
a derivative, or homolog thereof, but which does not specifically bind to the individual 
proteins of the complex, or a derivative, or homolog thereof, one can select on the basis 
of positive binding to the complex and a lack of binding to the individual protein 
components. 

Antibodies specific to a domain of the complex, or a derivative, or homolog 
thereof, are also provided. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and/or quantification of the complexes of the invention, e.g., for imaging 
these proteins, measuring levels thereof in appropriate physiological samples (by 
immunoassay), in diagnostic methods, etc. This hold true also for a derivative, or 
homolog thereof of a complex. 

In another embodiment of the invention (see infra), an antibody to a complex or a 
fragment of such antibodies containing the antibody binding domain, is a Therapeutic. 

5.3. DIAGNOSTIC. PROGNOSTIC. AND SCREENING USES OF PROTEIN 
COMPLEXES 

The particular protein complexes of the present invention may be markers of 
normal physiological processes, and thus have diagnostic utility. Further, definition of 
particular groups of patients with elevations or deficiencies of a protein complex of the 
present invention, or wherein the protein complex has a change in protein component 
composition, can lead to new nosological classifications of diseases, furthering 
diagnostic ability. 

Examples for diseases or disorders in which the complexes provided herein are involved 
and/or associated with are infectious diseases; viral infections such as herpes simplex 
infections, Epstein-Barr-infections, influenza; metabolic disease such as metachromatic 
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leukodystrophy; neurodegenerative disorders such as amyotrophic lateral sclerosis and 
cancer. 

Detecting levels of protein complexes, or individual component proteins that form 
the complexes, or detecting levels of the mRNAs encoding the components of the 
complex, may be used in diagnosis, prognosis, and/or staging to follow the course of a 
disease state, to follow a therapeutic response, etc. 

A protein complex of the present invention and the individual components of the 
complex and a derivative, analog or subsequence thereof, encoding nucleic acids (and 
sequences complementary thereto), and anti-complex antibodies and antibodies directed 
against individual components that can form the complex, are useful in diagnostics. The 
foregoing molecules can be used in assays, such as immunoassays, to detect, 
prognose, diagnose, or monitor various conditions, diseases, and disorders 
characterized by aberrant levels of a complex or aberrant component composition of a 
complex, or monitor the treatment of such various conditions, diseases, and disorders. 

In particular, such an immunoassay is carried out by a method comprising 
contacting a sample derived from a patient with an anti-complex antibody under 
conditions such that immunospecific binding can occur, and detecting or measuring the 
amount of any immunospecific binding by the antibody. In a specific aspect, such 
binding of antibody, in tissue sections, can be used to detect aberrant complex 
localization, or aberrant (e.g., high, low or absent) levels of a protein complex or 
complexes. In a specific embodiment, an antibody to the complex can be used to assay 
a patient tissue or serum sample for the presence of the complex, where an aberrant 
level of the complex is an indication of a diseased condition. By "aberrant levels" is 
meant increased or decreased levels relative to that present, or a standard level 
representing that present, in an analogous sample from a portion or fluid of the body, or 
from a subject not having the disorder. 

The immunoassays which can be used include but are not limited to competitive 
and non-competitive assay systems using techniques such as Western blots, 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoprecipitation assays, precipitin reactions, gel diffusion precipitin 
reactions, immunodiffusion assays, agglutination assays, complement-fixation assays, 
immunoradiometric assays, fluorescent immunoassays, protein A immunoassays, to 
name but a few known in the art. 
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Nucleic acids encoding the components of the protein complex and related 
nucleic acid sequences and subsequences, including complementary sequences, can be 
used in hybridization assays. The nucleic acid sequences, or subsequences thereof, 
comprising about at least 8 nucleotides, can be used as hybridization probes. 
Hybridization assays can be used to detect, prognose, diagnose, or monitor conditions, 
disorders, or disease states associated with aberrant levels of the mRNAs encoding the 
components of a complex as described, supra. In particular, such a hybridization assay 
is carried out by a method comprising contacting a sample containing nucleic acid with a 
nucleic acid probe capable of hybridizing to component protein coding DNA or RNA, 
under conditions such that hybridization can occur, and detecting or measuring any 
resulting hybridization. 

In specific embodiments, diseases and disorders involving or characterized by 
aberrant levels of a protein complex or aberrant complex composition can be diagnosed, 
or its suspected presence can be screened for, or a predisposition to develop such 
disorders can be detected, by determining the component protein composition of the 
complex, or detecting aberrant levels of a member of the complex or un-complexed 
component proteins or encoding nucleic acids, or functional activity including, but not 
restricted to, binding to an interacting partner, or by detecting mutations in component 
protein RNA, DNA or protein (e.g., mutations such as translocations, truncations, 
changes in nucleotide or amino acid sequence relative to wild-type that cause increased 
or decreased expression or activity of a complex, and/or component protein. Such 
diseases and disorders include, but are not limited to, those described in Section 5.4 and 
its subsections. 

By way of example, levels of a protein complex and the individual components of 
a complex can be detected by immunoassay, levels of component protein RNA or DNA 
can be detected by hybridization assays (e.g., Northern blots, dot blots, RNase 
protection assays), and binding of component proteins to each other (e.g., complex 
formation) can be measured by binding assays commonly known in the art. 
Translocations and point mutations in component protein genes can be detected by 
Southern blotting, RFLP analysis, PCR using primers that preferably generate a 
fragment spanning at least most of the gene by sequencing of genomic DNA or cDNA 
obtained from the patient, etc. 

Assays well known in the art (e.g., assays described above such as 
immunoassays, nucleic acid hybridization assays, activity assays, etc.) can be used to 
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determine whether one or more particular protein complexes are present at either 
increased or decreased levels, or are absent, in samples from patients suffering from a 
particular disease or disorder, or having a predisposition to develop such a disease or 
disorder, as compared to the levels in samples from subjects not having such a disease 
or disorder, or having a predisposition to develop such a disease or disorder. 

Additionally, these assays can be used to determine whether the ratio of the complex to 
the un-complexed components of the complex, is increased or decreased in samples 
from patients suffering from a particular disease or disorder, or having a predisposition to 
develop such a disease or disorder, as compared to the ratio in samples from subjects 
not having such a disease or disorder. In the event that levels of one or more particular 
protein complexes (/.e., complexes formed from component protein derivatives, 
homologs, fragments, or analogs) are determined to be increased in patients suffering 
from a particular disease or disorder, or having a predisposition to develop such a 
disease or disorder, then the particular disease or disorder, or predisposition for a 
disease or disorder, can be diagnosed, have prognosis defined for, be screened for, or 
be monitored by detecting increased levels of the one or more protein complexes, 
increased levels of the mRNA that encodes one or more members of the one or more 
particular protein complexes, or by detecting increased complex functional activity. 

Accordingly, in a specific embodiment of the present invention, diseases and 
disorders involving increased levels of one or more protein complexes can be diagnosed, 
or their suspected presence can be screened for, or a predisposition to develop such 
disorders can be detected, by detecting increased levels of the one or more protein 
complexes, the mRNA encoding both members of the complex, or complex functional 
activity, or by detecting mutations in the component proteins that stabilize or enhance 
complex formation, e.g., mutations such as translocations in nucleic acids, truncations in 
the gene or protein, changes in nucleotide or amino acid sequence relative to wild-type, 
that stabilize or enhance complex formation. 

In the event that levels of one or more particular protein complexes are 
determined to be decreased in patients suffering from a particular disease or disorder, or 
having a predisposition to develop such a disease or disorder, then the particular disease 
or disorder or predisposition for a disease or disorder can be diagnosed, have its 
prognosis determined, be screened for, or be monitored by detecting decreased levels of 
the one or more protein complexes, the mRNA that encodes one or more members of 
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the particular one or more protein complexes, or by detecting decreased protein complex 
functional activity. 

Accordingly, in a specific embodiment of the invention, diseases and disorders 
involving decreased levels of one or more protein complexes can be diagnosed, or their 
suspected presence can be screened for, or a predisposition to develop such disorders 
can be detected, by detecting decreased levels of the one or more protein complexes, 
the mRNA encoding one or more members of the one or more complexes, or complex 
functional activity, or by detecting mutations in the component proteins that decrease 
complex formation, e.g., mutations such as translocations in nucleic acids, truncations in 
the gene or protein, changes in nucleotide or amino acid sequence relative to wild-type, 
that decrease complex formation. 

Accordingly, in a specific embodiment of the invention, diseases and disorders 
involving aberrant compositions of the complexes can be diagnosed, or their suspected 
presence can be screened for, or a predisposition to develop such disorders can be 
detected, by detecting the component proteins of one or more complexes, or the mRNA 
encoding the members of the one or more complexes. 

The use of detection techniques, especially those involving antibodies against a 
protein complex, provides a method of detecting specific cells that express the complex 
or component proteins. Using such assays, specific cell types can be defined in which 
one or more particular protein complexes are expressed, and the presence of the 
complex or component proteins can be correlated with cell viability, state, health, etc. 

Also embodied are methods to detect a protein complex of the present invention 
in cell culture models that express particular protein complexes or derivatives thereof, for 
the purpose of characterizing or preparing the complexes for harvest. This embodiment 
includes cell sorting of prokaryotes such as but not restricted to bacteria (Davey and Kell, 
1996, Microbiol. Rev. 60:641-696), primary cultures and tissue specimens from 
eukaryotes, including mammalian species such as human (Steele etal., 1996, Clin. 
Obstet. Gynecol 39:801-813), and continuous cell cultures (Orfao and Ruiz-Arguelles, 
1996, Clin. Biochem. 29:5-9). Such isolations can be used as methods of diagnosis, 
described, supra. 

5.4. THERAPEUTIC USES OF PROTEIN COMPLEXES 

The present invention is directed to a method for treatment or prevention of 
various diseases and disorders by administration of a therapeutic compound (termed 




WO 02/092626 



56 



PCT/EP02/05359 



herein ’Therapeutic"). Such ’Therapeutics" include, but are not limited to, a protein 
complex of the present invention, the individual component proteins, and analogs and 
derivatives (including fragments) of the foregoing (e.g., as described hereinabove); 
antibodies thereto (as described hereinabove); nucleic acids encoding the component 
protein, and analogs or derivatives, thereof (e.g., as described hereinabove); component 
protein antisense nucleic acids, and agents that modulate complex formation and/or 
activity ( i.e ., agonists and antagonists). 

The protein complexes, as identified herein, are implicated significantly in normal 
physiological processes such as RNA processing and modification.. 

Furthermore, the protein complexes as identified herein are implicated in 
processes which are implicated in or associated with pathological conditions. 

Diseases and disorders which can be treated and/or prevented and/or 
diagnosed by Therapeutics interacting with any of the complexes provided herein are for 
example infectious diseases; viral infections such as herpes simplex infections, Epstein- 
Barr-infections, influenza; metabolic disease such as metachromatic leukodystrophy; 
neurodegenerative disorders such as amyotrophic lateral sclerosis and cancer. 

These disorders are treated or prevented by administration of a 
Therapeutic that modulates (i.e. inhibits or promotes) protein complex activity or 
formation. Diseases or disorders associated with aberrant levels of complex activity or 
formation, or aberrant levels or activity of the component proteins, or aberrant complex 
composition, may be treated by administration of a Therapeutic that modulates complex 
formation or activity or by the administration of a protein complex. 

Therapeutic may also be administered to modulate complex formation or 
activity or level thereof in a microbial organism such as yeast, fungi such as Candida 
albicans causing an infectious disease in animals or humans. 

Diseases and disorders characterized by increased (relative to a subject not 
suffering from the disease or disorder) complex levels or activity can be treated with 
Therapeutics that antagonize (i.e., reduce or inhibit) complex formation or activity. 
Therapeutics that can be used include, but are not limited to, the component proteins or 
an analog, derivative or fragment of the component protein; anti-complex antibodies 
(e.g., antibodies specific for the protein complex, or a fragment or derivative of the 
antibody containing the binding region thereof; nucleic acids encoding the component 
proteins; antisense nucleic acids complementary to nucleic acids encoding the 
component proteins; and nucleic acids encoding the component protein that are 
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dysfunctional due to, e.g., a heterologous insertion within the protein coding sequence, 
that are used to "knockout" endogenous protein function by homologous recombination, 
see, e.g., Capecchi, 1989, Science 244:1288-1292. In one embodiment, a Therapeutic 
is 1 , 2 or more antisense nucleic acids which are complementary to 1 , 2, or more nucleic 
acids, respectfully, that encode component proteins of a complex. 

In a specific embodiment of the present invention, a nucleic acid containing a 
portion of a component protein gene in which gene sequences flank (are both 5’ and 3’ 
to) a different gene sequence, is used as a component protein antagonist, or to promote 
component protein inactivation by homologous recombination (see also, Koller and 
Smithies, 1989, Proc. Natl. Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 
342: 435-438). Additionally, mutants or derivatives of a component protein that has 
greater affinity for another component protein or the complex than wild type may be 
administered to compete with wild type protein for binding, thereby reducing the levels of 
complexes containing the wild type protein. Other Therapeutics that inhibit complex 
function can be identified by use of known convenient in vitro assays, e.g., based on their 
ability to inhibit complex formation, or as described in Section 5.5, infra. 

In specific embodiments, Therapeutics that antagonize complex formation or 
activity are administered therapeutically, including prophylactically, (1) in diseases or 
disorders involving an increased (relative to normal or desired) level of a complex, for 
example, in patients where complexes are overactive or overexpressed; or (2) in 
diseases or disorders where an in vitro (or in vivo) assay (see infra) indicates the utility of 
antagonist administration. Increased levels of a complex can be readily detected, e.g., 
by quantifying protein and/or RNA, by obtaining a patient tissue sample (e.g., from 
biopsy tissue) and assaying it in vitro for RNA or protein levels, or structure and/or 
activity of the expressed complex (or the encoding mRNA). Many methods standard in 
the art can be thus employed including, but not limited to, immunoassays to detect 
complexes and/or visualize complexes (e.g., Western blot analysis, immunoprecipitation 
followed by sodium dodecyl sulfate polyacrylamide gel electrophoresis [SDS-PAGE], 
immunocytochemistry, etc.), and/or hybridization assays to detect concurrent expression 
of component protein mRNA (e.g., Northern assays, dot blot analysis, in situ 
hybridization, etc.). 

A more specific embodiment of the present invention is directed to a method of 
reducing complex expression (/'.e., expression of the protein components of the complex 
and/or formation of the complex) by targeting mRNAs that express the protein moieties. 
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RNA therapeutics currently fall within three classes, antisense species, ribozymes, or 
RNA aptamers (Good et al., 1997, Gene Therapy 4:45-54). 

Antisense oligonucleotides have been the most widely used. By way of example, 
but not limitation, antisense oligonucleotide methodology to reduce complex formation is 
presented below, infra. Ribozyme therapy involves the administration, induced 
expression, etc. of small RNA molecules with enzymatic ability to cleave, bind, or 
otherwise inactivate specific RNAs, to reduce or eliminate expression of particular 
proteins (Grassi and Marini, 1996, Annals of Medicine 28:499-510; Gibson, 1996, Cancer 
and Metastasis Reviews 15:287-299). RNA aptamers are specific RNA ligand proteins, 
such as for Tat and Rev RNA (Good et al., 1997, Gene Therapy 4:45-54) that can 
specifically inhibit their translation. Aptamers specific for component proteins can be 
identified by many methods well known in the art, for example, by affecting the formation 
of a complex in the protein-protein interaction assay described, infra. 

In another embodiment, the activity or levels of a component protein are reduced 
by administration of another component protein, or the encoding nucleic acid, or an 
antibody that immunospecifically binds to the component protein, or a fragment or a 
derivative of the antibody containing the binding domain thereof. 

In another aspect of the invention, diseases or disorders associated with 
increased levels of an component protein of the complex may be treated or prevented by 
administration of a Therapeutic that increases complex formation if the complex 
formation acts to reduce or inactivate the component protein through complex formation. 
Such diseases or disorders can be treated or prevented by administration of one 
component member of the complex, administration of antibodies or other molecules that 
stabilize the complex, etc. 

Diseases and disorders associated with underexpression of a complex, or a 
component protein, are treated or prevented by administration of a Therapeutic that 
promotes ( i.e ., increases or supplies) complex levels and/or function, or individual 
component protein function. Examples of such a Therapeutic include but are not limited 
to a complex or a derivative, analog or fragment of the complex that are functionally 
active (e.g., able to form a complex), un-complexed component proteins and derivatives, 
analogs, and fragments of un-complexed component proteins, and nucleic acids 
encoding the members of a complex or functionally active derivatives or fragments of the 
members of the complex, e.g., for use in gene therapy. In a specific embodiment, a 
Therapeutic includes derivatives, homologs or fragments of a component protein that 
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increase and/or stabilize complex formation. Examples of other agonists can be 
identified using in vitro assays or animal models, examples of which are described, infra. 

In yet other specific embodiments of the present invention, Therapeutics that 
promote complex function are administered therapeutically, including prophylactically, (1) 
in diseases or disorders involving an absence or decreased (relative to normal or 
desired) level of a complex, for example, in patients where a complex, or the individual 
components necessary to form the complex, is lacking, genetically defective, biologically 
inactive or underactive, or under-expressed; or (2) in diseases or disorders wherein an in 
vitro or in vivo assay (see, infra) indicates the utility of complex agonist administration. 
The absence or decreased level of a complex, component protein or function can be 
readily detected, e.g., by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or protein levels, structure and/or activity of the expressed 
complex and/or the concurrent expression of m RNA encoding the two components of the 
complex. Many methods standard in the art can be thus employed, including but not 
limited to immunoassays to detect and/or visualize a complex, or the individual 
components of a complex (e.g., Western blot analysis, immunoprecipitation followed by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis [SDS-PAGE], 
immunocytochemistry, etc.) and/or hybridization assays to detect expression of mRNAs 
encoding the individual protein components of a complex by detecting and/or visualizing 
component mRNA concurrently or separately using, e.g., Northern assays, dot blot 
analysis, in situ hybridization, etc. 

In specific embodiments, the activity or levels of a component protein are 
increased by administration of another component protein of the same complex, or a 
derivative, homolog or analog thereof, a nucleic acid encoding the other component, or 
an agent that stabilizes or enhances the other component, or a fragment or derivative of 
such an agent. 

Generally, administration of products of species origin or species reactivity (in the 
case of antibodies) that is the same species as that of the patient is preferred. Thus, in a 
preferred embodiment, a human complex, or derivative, homolog or analog thereof; 
nucleic acids encoding the members of the human complex or a derivative, homolog or 
analog thereof; an antibody to a human complex, or a derivative thereof; or other human 
agents that affect component proteins or the complex, are therapeutically or 
prophylactically administered to a human patient. 
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Preferably, suitable in vitro or in vivo assays are utilized to determine the effect of 
a specific Therapeutic and whether its administration is indicated for treatment of the 
affected tissue or individual. 

In various specific embodiments, in vitro assays can be carried out with 
representative cells of cell types involved in a patient's disorder, to determine if a 
Therapeutic has a desired effect upon such cell types. 

Compounds for use in therapy can be tested in suitable animal model systems 
prior to testing in humans, including, but not limited to, rats, mice, chicken, cows, 
monkeys, rabbits, etc. For in vivo testing, prior to administration to humans, any animal 
model system known in the art may be used. Additional descriptions and sources of 
Therapeutics that can be used according to the invention are found in Sections 5.1 to 5.3 
and 5.7 herein. 

5.4.1. GENE THERAPY 

In a specific embodiment of the present invention, nucleic acids comprising a 
sequence encoding the component proteins, or a functional derivative thereof, are 
administered to modulate complex activity or formation by way of gene therapy. Gene 
therapy refers to therapy performed by the administration of a nucleic acid to a subject. 

In this embodiment of the present invention, the nucleic acid expresses its encoded 
protein(s) that mediates a therapeutic effect by modulating complex activity or formation. 
Any of the methods for gene therapy available in the art can be used according to the 
present invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see Goldspiel et al., 1 993, 
Clinical Pharmacy 12:488-505; Wu and Wu, 1991, Biotherapy 3:87-95; Tolstoshev, 1993, 
Ann. Rev. Pharmacol. Toxicol. 32:573-596; Mulligan, 1993, Science 260:926-932; 

Morgan and Anderson, 1993, Ann. Rev. Biochem. 62:191-217; and May, 1993, TIBTECH 
1 1 :1 55-21 5. Methods commonly known in the art of recombinant DNA technology which 
can be used are described in Ausubel et al., eds., 1993, Current Protocols in Molecular 
Biology, John Wiley & Sons, NY; and Kriegler, 1990, Gene Transfer and Expression, A 
Laboratory Manual, Stockton Press, NY. 

In a preferred aspect, the Therapeutic comprises a nucleic acid that is part of an 
expression vector that expresses one or more of the component proteins, or fragments 
or chimeric proteins thereof, in a suitable host. In particular, such a nucleic acid has a 
promoter operably linked to the protein coding region(s) (or, less preferably separate 
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promoters linked to the separate coding regions separately), said promoter being 
inducible or constitutive, and optionally, tissue-specific. In another particular 
embodiment, a nucleic acid molecule is used in which the coding sequences, and any 
other desired sequences, are flanked by regions that promote homologous 
recombination at a desired site in the genome, thus providing for intra-chromosomal 
expression of the component protein nucleic acids (Koller and Smithies, 1989, Proc. Natl. 
Acad. Sci. USA 86:8932-8935; Zijlstra et al., 1989, Nature 342:435-438). 

Delivery of the nucleic acid into a patient may be either direct, in which case the 
patient is directly exposed to the nucleic acid or nucleic acid-carrying vector, or indirect, 
in which case, cells are first transformed with the nucleic acid in vitro, then transplanted 
into the patient. These two approaches are known, respectively, as in vivo or ex vivo 
gene therapy. 

In a specific embodiment, the nucleic acid is directly administered in vivo, where it 
is expressed to produce the encoded product. This can be accomplished by any of 
numerous methods known in the art, e.g., by constructing it as part of an appropriate 
nucleic acid expression vector and administering it so that it becomes intracellular, e.g., 
by infection using a defective or attenuated retroviral or other viral vector (U.S. Patent 
No. 4,980,286), or by direct injection of naked DNA, or by use of microparticle 
bombardment (e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface 
receptors, or through use of transfecting agents, by encapsulation in liposomes, 
microparticles, or microcapsules, or by administering it in linkage to a peptide that is 
known to enter the nucleus, or by administering it in linkage to a ligand subject to 
receptor-mediated endocytosis that can be used to target cell types specifically 
expressing the receptors (e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432), etc. 

In another embodiment, a nucleic acid-ligand complex can be formed in which the ligand 
comprises a fusogenic viral peptide that disrupts endosomes, allowing the nucleic acid to 
avoid lysosomal degradation. In yet another embodiment, the nucleic acid can be 
targeted in vivo for cell specific uptake and expression, by targeting a specific receptor 
(see, e.g., International Patent Publications WO 92/06180; WO 92/22635; WO 92/20316; 
WO 93/14188; and WO 93/20221. Alternatively, the nucleic acid can be introduced 
intracellularly and incorporated within host cell DNA for expression, by homologous 
recombination (Koller and Smithies, 1989, Proc. Natl. Acad. Sci. USA 86:8932-8935, 
Zijlstra et al., 1989, Nature 342:435-438). 
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In a specific embodiment, a viral vector that contains the component protein 
encoding nucleic acids is used. For example, a retroviral vector can be used (Miller et 
al., 1993, Meth. Enzymol. 217:581-599). These retroviral vectors have been modified to 
delete retroviral sequences that are not necessary for packaging of the viral genome and 
integration into host cell DNA. The encoding nucleic acids to be used in gene therapy 
is/are cloned into the vector, which facilitates delivery of the gene into a patient. More 
detail about retroviral vectors can be found in Boesen et al., 1994, Biotherapy 6:291-302, 
which describes the use of a retroviral vector to deliver the mdrl gene to hematopoetic 
stem cells in order to make the stem cells more resistant to chemotherapy. Other 
references illustrating the use of retroviral vectors in gene therapy are Clowes et al., 

1994, J. Clin. Invest. 93:644-651; Kiem et al., 1994, Blood 83:1467-1473; Salmons and 
Gunzberg, 1993, Human Gene Therapy 4:129-141; and Grossman and Wilson, 1993, 
Curr. Opin. in Genetics and Devel. 3:1 10-1 14. 

Adenoviruses are other viral vectors that can be used in gene therapy. 
Adenoviruses are especially attractive vehicles for delivering genes to respiratory 
epithelia. Adenoviruses naturally infect respiratory epithelia where they cause a mild 
disease. Other targets for adenovirus-based delivery systems are the liver, the central 
nervous system, endothelial cells and muscle. Adenoviruses have the advantage of 
being capable of infecting non-dividing cells. Kozarsky and Wilson, 1993, Current 
Opinion in Genetics and Development 3:499-503, discuss adenovirus-based gene 
therapy. The use of adenovirus vectors to transfer genes to the respiratory epithelia of 
rhesus monkeys has been demonstrated by Bout et al., 1994, Human Gene Therapy 5:3- 
10. Other instances of the use of adenoviruses in gene therapy can be found in 
Rosenfeld etal., 1991, Science 252:431-434; Rosenfeld etal., 1992, Cell 68:143-155; 
and Mastrangeli et al., 1993, J. Clin. Invest. 91:225-234. 

Adeno-associated virus (AAV) has also been proposed for use in gene therapy 
(Walsh et al., 1993, Proc. Soc. Exp. Biol. Med. 204:289-300. 

Another approach to gene therapy involves transferring a gene into cells in tissue 
culture by methods such as electroporation, lipofection, calcium phosphate-mediated 
transfection, or viral infection. Usually, the method of transfer includes the transfer of a 
selectable marker to the cells. The cells are then placed under selection to isolate those 
cells that have taken up and are expressing the transferred gene from these that have 
not. Those cells are then delivered to a patient. 
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In this embodiment, the nucleic acid is introduced into a cell prior to administration 
in vivo of the resulting recombinant cell. Such introduction can be carried out by any 
method known in the art including, but not limited to, transfection by electroporation, 
microinjection, infection with a viral or bacteriophage vector containing the nucleic acid 
sequences, cell fusion, chromosome-mediated gene transfer, microcell-mediated gene 
transfer, spheroplast fusion, etc. Numerous techniques are known in the art for the 
introduction of foreign genes into cells (see, e.g., Loeffler and Behr, 1993, Meth. 

Enzymol. 217:599-618; Cohen etal., 1993, Meth. Enzymol. 217:618-644; Cline, 1985, 
Pharmac. Ther. 29:69-92) and may be used in accordance with the present invention, 
provided that the necessary developmental and physiological functions of the recipient 
cells are not disrupted. The technique should provide for the stable transfer of the 
nucleic acid to the cell, so that the nucleic acid is expressible by the cell and preferably, 
is heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various methods 
known in the art. In a preferred embodiment, epithelial cells are injected, e.g., 
subcutaneously. In another embodiment, recombinant skin cells may be applied as a 
skin graft onto the patient. Recombinant blood cells (e.g., hematopoetic stem or 
progenitor cells) are preferably administered intravenously. The amount of cells 
envisioned for use depends on the desired effect, patient state, etc., and can be 
determined by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of gene therapy 
encompass any desired, available cell type, and include but are not limited to epithelial 
cells, endothelial cells, keratinocytes, fibroblasts, muscle cells, hepatocytes, blood cells 
such as T lymphocytes, B lymphocytes, monocytes, macrophages, neutrophils, 
eosinophils, megakaryocytes, and granulocytes, various stem or progenitor cells, in 
particular hematopoetic stem or progenitor cells, e.g., as obtained from bone marrow, 
umbilical cord blood, peripheral blood, fetal liver, etc. 

In a preferred embodiment, the cell used for gene therapy is autologous to the 

patient. 

In an embodiment in which recombinant cells are used in gene therapy, a 
component protein encoding nucleic acid is/are introduced into the cells such that the 
gene or genes are expressible by the cells or their progeny, and the recombinant cells 
are then administered in vivo for therapeutic effect. In a specific embodiment, stem or 
progenitor cells are used. Any stem and/or progenitor cells which can be isolated and 




WO 02/092626 



64 



PCT/EP02/05359 



maintained in vitro can potentially be used in accordance with this embodiment of the 
present invention. Such stem cells include but are not limited to hematopoetic stem cells 
(HSCs), stem cells of epithelial tissues such as the skin and the lining of the gut, 
embryonic heart muscle cells, liver stem cells (International Patent Publication WO 
94/08598), and neural stem cells (Stemple and Anderson, 1992, Cell 71:973-985). 

Epithelial stem cells (ESCs), or keratinocytes, can be obtained from tissues such 
as the skin and the lining of the gut by known procedures (Rheinwald, 1 980, Meth. Cell 
Biol. 2A:229). In stratified epithelial tissue such as the skin, renewal occurs by mitosis of 
stem cells within the germinal layer, the layer closest to the basal lamina. Similarly, stem 
cells within the lining of the gut provide for a rapid renewal rate of this tissue. ESCs or 
keratinocytes obtained from the skin or lining of the gut of a patient or donor can be 
grown in tissue culture (Rheinwald, 1980, Meth. Cell Bio. 2A:229; Pittelkow and Scott, 
1986, Mayo Clinic Proc. 61:771). If the ESCs are provided by a donor, a method for 
suppression of host versus graft reactivity (e.g., irradiation, or drug or antibody 
administration to promote moderate immunosuppression) can also be used. 

With respect to hematopoetic stem cells (HSCs), any technique that provides for 
the isolation, propagation, and maintenance in vitro of HSCs can be used in this 
embodiment of the invention. Techniques by which this may be accomplished include 
(a) the isolation and establishment of HSC cultures from bone marrow cells isolated from 
the future host, or a donor, or (b) the use of previously established long-term HSC 
cultures, which may be allogeneic or xenogeneic. Non-autologous HSCs are used 
preferably in conjunction with a method of suppressing transplantation immune reactions 
between the future host and patient. In a particular embodiment of the present invention, 
human bone marrow cells can be obtained from the posterior iliac crest by needle 
aspiration (see, e.g., Kodo et al., 1984, J. Clin. Invest. 73: 1377-1384). In a preferred 
embodiment of the present invention, the HSCs can be made highly enriched or in 
substantially pure form. This enrichment can be accomplished before, during, or after 
long-term culturing, and can be done by any technique known in the art. Long-term 
cultures of bone marrow cells can be established and maintained by using, for example, 
modified Dexter cell culture techniques (Dexter et al., 1977, J. Cell Physiol. 91:335) or 
Witlock-Witte culture techniques (Witlock and Witte, 1982, Proc. Natl. Acad. Sci. USA 
79:3608-3612). 

In a specific embodiment, the nucleic acid to be introduced for purposes of gene 
therapy comprises an inducible promoter operably linked to the coding region, such that 
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expression of the nucleic acid is controllable by controlling the presence or absence of 
the appropriate inducer of transcription. 

Additional methods can be adapted for use to deliver a nucleic acid encoding the 
component proteins, or functional derivatives thereof, e.g., as described in Section 5.1, 
supra. 

5.4.2. USE OF ANTISENSE OLIGONUCLEOTIDES FOR SUPPRESSION OF 
PROTEIN COMPLEX ACTIVITY OR FORMATION 

In a specific embodiment of the present invention, protein complex activity and 
formation is inhibited by use of antisense nucleic acids for the component proteins of the 
complex, that inhibit transcription and/or translation of their complementary sequence. 
The present invention provides the therapeutic or prophylactic use of nucleic acids of at 
least six nucleotides that are antisense to a gene or cDNA encoding a component 
protein, or a portion thereof. An "antisense" nucleic acid as used herein refers to a 
nucleic acid capable of hybridizing to a sequence-specific portion of a component protein 
RNA (preferably mRNA) by virtue of some sequence complementarity. The antisense 
nucleic acid may be complementary to a coding and/or noncoding region of a component 
protein mRNA. Such antisense nucleic acids that inhibit complex formation or activity 
have utility as Therapeutics, and can be used in the treatment or prevention of disorders 
as described supra. 

The antisense nucleic acids of the invention can be oligonucleotides that are 
double-stranded or single-stranded, RNA or DNA, or a modification or derivative thereof, 
which can be directly administered to a cell, or which can be produced intracellularty by 
transcription of exogenous, introduced sequences. 

In another embodiment, the present invention is directed to a method for inhibiting 
the expression of component protein nucleic acid sequences, in a prokaryotic or 
eukaryotic cell, comprising providing the cell with an effective amount of a composition 
comprising an antisense nucleic acid of the component protein, or a derivative thereof, of 
the invention. 

The antisense nucleic acids are of at least six nucleotides and are preferably 
oligonucleotides, ranging from 6 to about 200 nucleotides. In specific aspects, the 
oligonucleotide is at least 10 nucleotides, at least 15 nucleotides, at least 100 
nucleotides, or at least 200 nucleotides. The oligonucleotides can be DNA or RNA or 
chimeric mixtures, or derivatives or modified versions thereof, and either single-stranded 
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or double-stranded. The oligonucleotide can be modified at the base moiety, sugar 
moiety, or phosphate backbone. The oligonucleotide may include other appending 
groups such as peptides, agents facilitating transport across the cell membrane (see, 
e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 
1987, Proc. Natl. Acad. Sci. 84:648-652; International Patent Publication No. 

WO 88/09810) or blood-brain barrier (see, e.g., International Patent Publication No. 

WO 89/10134), hybridization-triggered cleavage agents (see, e.g., Krol et al., 1988, 
BioTechniques 6:958-976), or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 
5:539-549). 

In a preferred aspect of the invention, an antisense oligonucleotide is provided, 
preferably as single-stranded DNA. The oligonucleotide may be modified at any position 
in its structure with constituents generally known in the art. 

The antisense oligonucleotides may comprise at least one modified base moiety 
which is selected from the group including but not limited to 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl)uracil, 5-carboxymethylaminomethy!-2-thio-uridine, . 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5N-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methyl- 
thio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

In another embodiment, the oligonucleotide comprises at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, 
xylulose, and hexose. 

In yet another embodiment, the oligonucleotide comprises at least one modified 
phosphate backbone selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, 
a methylphosphonate, an alkyl phosphotriester, and a formacetal, or an analog of the 
foregoing. 
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In yet another embodiment, the oligonucleotide is a 2-a-anomeric oligonucleotide. 
An a-anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual B-units, the strands run parallel to 
each other (Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641). 

The oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization-triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

Oligonucleotides of the invention may be synthesized by standard methods 
known in the art, e.g., by use of an automated DNA synthesizer (such as are 
commercially avail-able from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligo-nucleotides may be synthesized by the method of Stein et al. 
(1988, Nucl. Acids Res. 16:3209), methylphosphonate oligonucleotides can be prepared 
by use of controlled pore glass polymer supports (Sarin et al., 1988, Proc. Natl. Acad. 

Sci. U.S.A. 85:7448-7451), etc. 

In a specific embodiment, the antisense oligonucleotides comprise catalytic 
RNAs, or ribozymes (see, e.g., International Patent Publication No. WO 90/1 1364; 

Sarver et al., 1990, Science 247:1222-1225). In another embodiment, the 
oligonucleotide is a 2’-0-methylribonucleotide (Inoue et al., 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analog (Inoue et al., 1987, FEBS Lett. 215:327- 
330). 

In an alternative embodiment, the antisense nucleic acids of the invention are 
produced intracellularly by transcription from an exogenous sequence. For example, a 
vector can be introduced in vivo such that it is taken up by a cell, within which cell the 
vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) of 
the invention. Such a vector would contain a sequence encoding the component protein. 
Such a vector can remain episomal or become chromosomally integrated, as long as it 
can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can 
be plasmid, viral, or others known in the art to be capable of replication and expression in 
mammalian cells. Expression of the sequences encoding the antisense RNAs can be by 
any promoter known in the art to act in mammalian, preferably human, cells. Such 
promoters can be inducible or constitutive. Such promoters include, but are not limited 
to, the SV40 early promoter region (Bemoist and Chambon, 1981, Nature 290:304-310), 
the promoter contained in the 3’ long terminal repeat of Rous sarcoma virus (Yamamoto 
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et al., 1980, Cell 22:787-797), the herpes thymidine kinase promoter (Wagner et ai., 

1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 296:39-42), etc. 

The antisense nucleic acids of the invention comprise a sequence complementary 
to at least a portion of an RNA transcript of a component protein gene, preferably a 
human gene. However, absolute complementarity, although preferred, is not required. A 
sequence "complementary to at least a portion of an RNA," as referred to herein, means 
a sequence having sufficient complementarity to be able to hybridize with the RNA, 
forming a stable duplex; in the case of double-stranded antisense nucleic acids, a single 
strand of the duplex DNA may thus be tested, or triplex formation may be assayed. The 
ability to hybridize will depend on both the degree of complementarity and the length of 
the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, the more 
base mismatches with a component protein RNA it may contain and still form a stable 
duplex (or triplex, as the case may be). One skilled in the art can ascertain a tolerable 
degree of mismatch by use of standard procedures to determine the melting point of the 
hybridized complex. 

The component protein antisense nucleic acids can be used to treat (or prevent) 
disorders of a cell type that expresses, or preferably overexpresses, a protein complex. 

Cell types that express or overexpress component protein RNA can be identified 
by various methods known in the art. Such methods include, but are not limited to, 
hybridization with component protein-specific nucleic acids (e.g., by Northern blot 
hybridization, dot blot hybridization, or in situ hybridization), or by observing the ability of 
RNA from the cell type to be translated in vitro into the component protein by 
immunohistochemistry, Western blot analysis, ELISA, etc. In a preferred aspect, primary 
tissue from a patient can be assayed for protein expression prior to treatment, e.g., by 
immunocytochemistry, in situ hybridization, or any number of methods to detect protein 
or mRNA expression. 

Pharmaceutical compositions of the invention (see Section 5.7, infra), comprising 
an effective amount of a protein component antisense nucleic acid in a pharmaceutically 
acceptable carrier can be administered to a patient having a disease or disorder that is of 
a type that expresses or overexpresses a protein complex of the present invention. 

The amount of antisense nucleic acid that will be effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or condition, and 
can be determined by standard clinical techniques. Where possible, it is desirable to 
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determine the antisense cytotoxicity in vitro, and then in useful animal model systems, 
prior to testing and use in humans. 

In a specific embodiment, pharmaceutical compositions comprising antisense 
nucleic acids are administered via liposomes, microparticles, or microcapsules. In 
various embodiments of the invention, it may be useful to use such compositions to 
achieve sustained release of the antisense nucleic acids. In a specific embodiment, it 
may be desirable to utilize liposomes targeted via antibodies to specific identifiable 
central nervous system cell types (Leonetti et al., 1990, Proc. Natl. Acad. Sci. U.S.A. 
87:2448-2451; Renneisen et al., 1990, J. Biol. Chem. 265:16337-16342). 

5.5. ASSAYS OF PROTEIN COMPLEXES AND DERIVATIVES AND ANALOGS 

THEREOF 

The functional activity of a protein complex of the present invention, or a 
derivative, fragment or analog thereof, can be assayed by various methods. Potential 
modulators (e.g., agonists and antagonists) of complex activity or formation, e.g., anti- 
complex antibodies and antisense nucleic acids, can be assayed for the ability to 
modulate complex activity or formation. 

In one embodiment of the present invention, where one is assaying for the ability 
to bind or compete with a wild-type complex for binding to an anti-complex antibody, 
various immunoassays known in the art can be used, including but not limited to 
competitive and non-competitive assay systems using techniques such as 
radioimmunoassay, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or 
radioisotope labels), western blot analysis, precipitation reactions, agglutination assays 
(e.g., gel agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, Immunoelectrophoresis assays, etc. In 
one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting binding 
of a secondary antibody or reagent to the primary antibody. In a further embodiment, the 
secondary antibody is labeled. Many means are known in the art for detecting binding in 
an immunoassay and are within the scope of the present invention. 

The expression of the component protein genes (both endogenous and those 
expressed from cloned DNA containing the genes) can be detected using techniques 
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known in the art, including but not limited to Southern hybridization (Southern, 1975, J. 
Mol. Biol. 98:503-517), northern hybridization (see, e.g., Freeman et al., 1983, Proc. Natl. 
Acad. Sci. USA 80:4094-4098), restriction endonuclease mapping (Sambrook et al., 

1989, Molecular Cloning, A Laboratory Manual, 2 nd Ed. Cold Spring Harbor Laboratory 
Press, New York), RNase protection assays (Current Protocols in Molecular Biology, 

John Wiley and Sons, New York, 1997), DNA sequence analysis, and polymerase chain 
reaction amplification (PCR; U.S. Patent Nos. 4,683,202, 4,683,195, and 4,889,818; 
Gyllenstein et al., 1988, Proc. Natl. Acad. Sci. USA 85:7652-7657; Ochman et al., 1988, 
Genetics 120:621-623; Loh et al., 1989, Science 243:217-220) followed by Southern 
hybridization with probes specific for the component protein genes, in various cell types. 
Methods of amplification other than PCR commonly known in the art can be employed. 

In one embodiment, Southern hybridization can be used to detect genetic linkage of 
component protein gene mutations to physiological or pathological states. Various cell 
types, at various stages of development, can be characterized for their expression of 
component proteins at the same time and in the same cells. The stringency of the 
hybridization conditions for northern or Southern blot analysis can be manipulated to 
ensure detection of nucleic acids with the desired degree of relatedness to the specific 
probes used. Modifications to these methods and other methods commonly known in 
the art can be used. 

Derivatives (e.g., fragments), homologs and analogs of one component protein 
can be assayed for binding to another component protein in the same complex by any 
method known in the art, for example the modified yeast matrix mating test described in 
Section 5.6.1 infra, immunoprecipitation with an antibody that binds to the component 
protein complexed with other component proteins in the same complex, followed by size 
fractionation of the immunoprecipitated proteins (e.g., by denaturing or nondenaturing 
polyacrylamide gel electrophoresis), Western blot analysis, etc. 

One embodiment of the invention provides a method for screening a derivative, 
homolog or analog of a component protein for biological activity comprising contacting 
said derivative, homolog or analog of the component protein with the other component 
proteins in the same complex; and detecting the formation of a complex between said 
derivative, homolog or analog of the component protein and the other component 
proteins; wherein detecting formation of said complex indicates that said derivative, 
homolog or analog of has biological (e.g., binding) activity. 
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The invention also provides methods of modulating the activity of a component 
protein that can participate in a protein complex by administration of a binding partner of 
that protein or derivative, homolog or analog thereof. 

In a specific embodiment of the present invention, a protein complex of the 
present invention is administered to treat or prevent a disease or disorder, since the 
complex and/or component proteins have been implicated in the disease and disorder. 
Accordingly, a protein complex or a derivative, homolog, analog or fragment thereof, 
nucleic acids encoding the component proteins, anti-complex antibodies, and other 
modulators of protein complex activity, can be tested for activity in treating or preventing 
a disease or disorder in in vitro and in vivo assays. 

In one embodiment, a Therapeutic of the invention can be assayed for activity in 
treating or preventing a disease by contacting cultured cells that exhibit an indicator of 
the disease in vitro, with the Therapeutic, and comparing the level of said indicator in the 
cells contacted with the Therapeutic, with said level of said indicator in cells not so 
contacted, wherein a lower level in said contacted cells indicates that the Therapeutic 
has activity in treating or preventing the disease. 

In another embodiment of the invention, a Therapeutic of the invention can be 
assayed for activity in treating or preventing a disease by administering the Therapeutic 
to a test animal that is predisposed to develop symptoms of a disease, and measuring 
the change in said symptoms of the disease after administration of said Therapeutic, 
wherein a reduction in the severity of the symptoms of the disease or prevention of the 
symptoms of the disease indicates that the Therapeutic has activity in treating or 
preventing the disease. Such a test animal can be any one of a number of animal 
models known in the art for disease. These animal models are well known in the art. 
These animal models include, but are not limited to those which are listed in the section 
5.6 (supra) as exemplary animald models to study any of the complexes provided in the 
invention. 

5.6 SCREENING FOR MODULATORS OF THE PROTEIN COMPLEXES 

A complex of the present invention, the component proteins of the complex and 
nucleic acids encoding the component proteins, as well as derivatives and fragments of 
the amino and nucleic acids, can be used to screen for compounds that bind to, or 
modulate the amount of, activity of, or protein component composition of, said complex, 
and thus, have potential use as modulators, i.e., agonists or antagonists, of complex 
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activity, and/or complex formation, /.e., the amount of complex formed, and/or protein 
component composition of the complex. 

Thus, the present invention is also directed to methods for screening for 
molecules that bind to, or modulate the amount of, activity of, or protein component 
composition of, a complex of the present invention. In one embodiment of the invention, 
the method for screening for a molecule that modulates directly or indirectly the function, 
activity or formation of a complex of the present invention comprises exposing said 
complex, or a cell or organism containing the complex machinery, to one or more 
candidate molecules under conditions conducive to modulation; and determining the 
amount of, activity of, or identities of the protein components of, said complex, wherein a 
change in said amount, activity, or identities relative to said amount, activity or identities 
in the absence of said candidate molecules indicates that the molecules modulate 
function, activity or formation of said complex. 

In another embodiment, the present invention further relates to a process for the 
identification and/or preparation of an effector of the cleavage/polyadenylation of 
precursor mRNA comprising the step of bringing into contact a product of any of claims 1 
to 7 with a compound, a mixture or a library of compounds and determining whether the 
compound or a certain compound of the mixture or library binds to the product and/or 
effects the products biological activity and optionally further purifying the compound 
positively tested as effector. 

In another embodiment, the present invention is directed to a method for 
screening for a molecule that binds a protein complex of the present invention 
comprising exposing said complex, or a cell or organism containing the complex 
machinery, to one or more candidate molecules; and determining whether said complex 
is bound by any of said candidate molecules. Such screening assays can be carried out 
using cell-free and cell-based methods that are commonly known in the art in vitro, in 
vivo or ex vivo. For example, an isolated complex can be employed, or a cell can be 
contacted with the candidate molecule and the complex can be isolated from such 
contacted cells and the isolated complex can be assayed for activity or component 
composition. In another example, a cell containing the complex can be contacted with 
the candidate molecule and the levels of the complex in the contacted cell can be 
measured. Additionally, such assays can be carried out in cells recombinantly 
expressing a component protein from column A of table 1 of a given row, or a 
functionally active fragment or functionally active derivative thereof, and a component 
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protein from column B of table 1 of said row, or a functionally active fragment or 
functionally active derivative thereof. Additionally, such assays can also be carried out in 
cells recombinantly expressing all component proteins from the group of proteins in 
column C of table 1. 

For example, assays can be carried out using recombinant cells expressing the 
protein components of a complex, to screen for molecules that bind to, or interfere with, 
or promote complex activity or formation. In preferred embodiments, polypeptide 
derivatives that have superior stabilities but retain the ability to form a complex (e.g., one 
or more component proteins modified to be resistant to proteolytic degradation in the 
binding assay buffers, or to be resistant to oxidative degradation), are used to screen for 
modulators of complex activity or formation. Such resistant molecules can be generated, 
e.g., by substitution of amino acids at proteolytic cleavage sites, the use of chemically 
derivatized amino acids at proteolytic susceptible sites, and the replacement of amino 
acid residues subject to oxidation, i.e. methionine and cysteine. 

A particular aspect of the present invention relates to identifying molecules that 
inhibit or promote formation or degradation of a complex of the present invention, e.g., 
using the method described for isolating the complex and identifying members of the 
complex using the TAP assay described in Section 6, infra, and in WO 00/09716 and 
Rigaut et al., 1999, Nature Biotechnology 17:1030-1032, which are each incorporated by 
reference in their entirety. 

In another embodiment of the invention, a modulator is identified by administering 
a candidate molecule to a transgenic non-human animal expressing the complex 
component proteins from promoters that are not the native promoters of the respective 
proteins, more preferably where the candidate molecule is also recombinantly expressed 
in the transgenic non-human animal. Alternatively, the method for identifying such a 
modulator can be carried out in vitro, preferably with a purified complex, and a purified 
candidate molecule. 

Agents/molecules (candidate molecules) to be screened can be provided as 
mixtures of a limited number of specified compounds, or as compound libraries, peptide 
libraries and the like. Agents/molecules to be screened may also include all forms of 
antisera, antisense nucleic acids, etc., that can modulate complex activity or formation. 
Exemplary candidate molecules and libraries for screening are set forth in Section 5.6.1 , 
infra. 
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Screening the libraries can be accomplished by any of a variety of commonly 
known methods. See, e.g., the following references, which disclose screening of peptide 
libraries: Parmley and Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott and Smith, 
1990, Science 249:386-390; Fowlkes et al., 1992, BioTechniques 13:422-427; Oldenburg 
et al., 1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al., 1994, Cell 76:933-945; 
Staudt et al., 1988, Science 241 :577-580; Bocketal., 1992, Nature 355:564-566; Tuerk 
et al., 1992, Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al., 1992, Nature 
355:850-852; U.S. Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent 
No. 5,198,346, all to Ladner et al.; Rebar and Pabo, 1993, Science 263:671-673; and 
International Patent Publication No. WO 94/18318. 

In a specific embodiment, screening can be carried out by contacting the library 
members with a complex immobilized on a solid phase, and harvesting those library 
members that bind to the protein (or encoding nucleic acid or derivative). Examples of 
such screening methods, termed "panning" techniques, are described byway of example 
in Parmley and Smith, 1988, Gene 73:305-318; Fowlkes et al., 1992, BioTechniques 
13:422-427; International Patent Publication No. WO 94/18318; and in references cited 
hereinabove. 

In a specific embodiment, fragments and/or analogs of protein components of a 
complex, especially peptidomimetics, are screened for activity as competitive or non- 
competitive inhibitors of complex formation (amount of complex or composition of 
complex) or activity in the cell, which thereby inhibit complex activity or formation in the 
cell. 

In one embodiment, agents that modulate ( i.e ., antagonize or agonize) complex 
activity or formation can be screened for using a binding inhibition assay, wherein agents 
are screened for their ability to modulate formation of a complex under aqueous, or 
physiological, binding conditions in which complex formation occurs in the absence of the 
agent to be tested. Agents that interfere with the formation of complexes of the invention 
are identified as antagonists of complex formation. Agents that promote the formation of 
complexes are identified as agonists of complex formation. Agents that completely block 
the formation of complexes are identified as inhibitors of complex formation. 

Methods for screening may involve labeling the component proteins of the 
complex with radioligands (e.g., 125 l or 3 H), magnetic ligands (e.g., paramagnetic beads 
covalently attached to photobiotin acetate), fluorescent ligands (e.g., fluorescein or 
rhodamine), or enzyme ligands (e.g., luciferase or beta-galactosidase). The reactants 
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that bind in solution can then be isolated by one of many techniques known in the art, 
including but not restricted to, co-immunoprecipitation of the labeled complex moiety 
using antisera against the unlabeled binding partner (or labeled binding partner with a 
distinguishable marker from that used on the second labeled complex moiety), 
immunoaffinity chromatography, size exclusion chromatography, and gradient density 
centrifugation. In a preferred embodiment, the labeled binding partner is a small 
fragment or peptidomimetic that is not retained by a commercially available filter. Upon 
binding, the labeled species is then unable to pass through the filter, providing for a 
simple assay of complex formation. 

Methods commonly known in the art are used to label at least one of the 
component members of the complex. Suitable labeling methods include, but are not 
limited to, radiolabeling by incorporation of radiolabeled amino acids, e.g., 3 H-leucine or 
35 S-methionine, radiolabeling by post-translational iodination with 125 l or 131 l using the 
chloramine T method, Bolton-Hunter reagents, etc., or labeling with 32 P using 
phosphorylase and inorganic radiolabeled phosphorous, biotin labeling with photobiotin- 
acetate and sunlamp exposure, etc. In cases where one of the members of the complex 
is immobilized, e.g., as described infra, the free species is labeled. Where neither of the 
interacting species is immobilized, each can be labeled with a distinguishable marker 
such that isolation of both moieties can be followed to provide for more accurate 
quantification, and to distinguish the formation of homomeric from heteromeric 
complexes. Methods that utilize accessory proteins that bind to one of the modified 
interactants to improve the sensitivity of detection, increase the stability of the complex, 
etc., are provided. 

Typical binding conditions are, for example, but not by way of limitation, in an 
aqueous salt solution of 10-250 mM NaCI, 5-50 mM Tris-HCI, pH 5-8, and 0.5% Triton X- 
100 or other detergent that improves specificity of interaction. Metal chelators and/or 
divalent cations may be added to improve binding and/or reduce proteolysis. Reaction 
temperatures may include 4, 10, 15, 22, 25, 35, or 42 degrees Celsius, and time of 
incubation is typically at least 15 seconds, but longer times are preferred to allow binding 
equilibrium to occur. Particular complexes can be assayed using routine protein binding 
assays to determine optimal binding conditions for reproducible binding. 

The physical parameters of complex formation can be analyzed by quantification 
of complex formation using assay methods specific for the label used, e.g., liquid 
scintillation counting for radioactivity detection, enzyme activity for enzyme-labeled 
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moieties, etc. The reaction results are then analyzed utilizing Scatchard analysis, Hill 
analysis, and other methods commonly known in the arts (see, e.g., Proteins, Structures, 
and Molecular Principles, 2 nd Edition (1993) Creighton, Ed., W.H. Freeman and 
Company, New York). 

In a second common approach to binding assays, one of the binding species is 
immobilized on a filter, in a microtiter plate well, in a test tube, to a chromatography 
matrix, etc., either covalently or non-covalently. Proteins can be covalently immobilized 
using any method well known in the art, for example, but not limited to the method of 
Kadonaga and Tjian, 1986, Proc. Natl. Acad. Sci. USA 83:5889-5893, i.e., linkage to a 
cyanogen-bromide derivatized substrate such as CNBr-Sepharose 4B (Pharmacia). 
Where needed, the use of spacers can reduce steric hindrance by the substrate. Non- 
covalent attachment of proteins to a substrate include, but are not limited to, attachment 
of a protein to a charged surface, binding with specific antibodies, binding to a third 
unrelated interacting protein, etc. 

Assays of agents (including cell extracts or a library pool) for competition for 
binding of one member of a complex (or derivatives thereof) with another member of the 
complex labeled by any means (e.g., those means described above) are provided to 
screen for competitors or enhancers of complex formation. 

In specific embodiments, blocking agents to inhibit non-specific binding of 
reagents to other protein components, or absorptive losses of reagents to plastics, 
immobilization matrices, etc., are included in the assay mixture. Blocking agents include, 
but are not restricted to bovine serum albumin, beta-casein, nonfat dried milk, Denhardt's 
reagent, Ficoll, polyvinylpyrolidine, nonionic detergents (NP40, Triton X-100, Tween 20, 
Tween 80, etc.), ionic detergents (e.g., SDS, LDS, etc.), polyethylene glycol, etc. 
Appropriate blocking agent concentrations allow complex formation. 

After binding is performed, unbound, labeled protein is removed in the 
supernatant, and the immobilized protein retaining any bound, labeled protein is washed 
extensively. The amount of bound label is then quantified using standard methods in the 
art to detect the label as described, supra. 

Moreover, a number of polyadenylation assays are described in the prior art. 

Such assays can be found in Bienroth, S.E.; Wahle, C.; Suter-Crazzolara, C. and Keller, 
W. (1991), J. Biol. Chem. 266, 19768-19776; Edwards-Gilbert, G. and Milcarek, C. 
(1995), Mol. Cell. Biol. 15, 6420-6429; Wahle, E. (1991), Cell 66, 759-768; Christofori, G. 
and Keller, W. (Cell) 54, 875-889. 
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Exemplary assays useful to measure the 3'end processing activity for 
mRNA of complex 162 include, but are not limited to those described in Kessler MM et 
al, 1996, J Biol. Chem. 271: 27167-75, and Butler, S. J. and Platt, T. (1988), Science' 
242, 1270-1274, and Moore, C.L. and Sharp, P.A. (1985), Cell 41, 845-855 

Exemplary assays useful to measure the cleavage step in 3'end 
processing activity of mRNA of complex 162 include, but are not limited to those 
described in Ruegsegger U etal., 1996, J Biol Chem 271: 6107-6113. 

An exemplary RNA binding assay can be carried out by contacting a complex having 
RNA binding activity with a radioactive [32P] end-labeled RNA substrate, e.g. a poly (A) 
RNA, under appropriate conditions and detecting bound protein. The detection of bound 
protein can be carried out, e.g., by filtrating the solution through a nitrocellulose filter and 
determining the radioactivity bound to the filter. This assay is based on the retention of 
nucleic acid-protein complexes on Nitrocellulose whereas free nucleic acid can pass 
through the filter 

(see e.g. Wahle, E., 1991, Methods 66: 759-68) 

An exemplary RNA exonuclease assay can be carried out by contacting a complex 
having RNA exonuclease activity with a radioactivity [32 phosphate] end-labeled RNA 
substrate under appropriate conditions and detecting the release of free radioactive 
nucleotides. The detection of free radioactive nucleotides can be carried out, e.g., by 
adding 20% trichloroacetic acid, filtrating the solution through a filter and measuring the 
amount of acid-soluble radioactivity 
(see e.g. Ross, J., 1999, Methods 17: 52-9) 

An exemplary mRNA splicing assay can be carried out by contacting a complex having 
mRNA splicing activity with a radioactively labeled RNA substrate under appropriate 
conditions and detecting the release of spliced RNA species. The detection of spliced 
RNA species can be carried out, e.g., by fractionation of processed RNAs in a glycerol 
gradient and subsequent analysis by denaturing polyacrylamide gel elecrophoresis and 
visualization by autoradiography. 

(see e.g. Schwer, B. and Gross, CH., 1998, Methods17: 2086-94) 
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An exemplary rRNA processing assay can be carried out by contacting a complex having 
rRNA processing activity with an pre-rRNA substrate under appropriate conditions and 
detecting the release of free processed rRNA species. The detection of processed rRNA 
species can be carried out, e.g., using a primer extension or northern blotting assay by 
measuring the size of the rRNA species. 

(see e.g. Kressler, D. et al, 1997, Methods 17: 7283-94) 



5.6.1. CANDIDATE MOLECULES 

Any molecule known in the art can be tested for its ability to modulate (increase or 
decrease) the amount of, activity of, or protein component composition of a complex of 
the present invention as detected by a change in the amount of, activity of, or protein 
component composition of, said complex. By way of example, a change in the amount of 
the complex can be detected by detecting a change in the amount of the complex that 
can be isolated from a cell expressing the complex machinery. For identifying a 
molecule that modulates complex activity, candidate molecules can be directly provided 
to a cell expressing the complex machinery, or, in the case of candidate proteins, can be 
provided by providing their encoding nucleic acids under conditions in which the nucleic 
acids are recombinantly expressed to produce the candidate proteins within the cell 
expressing the complex machinery, the complex is then isolated from the cell and the 
isolated complex is assayed for activity using methods well known in the art, not limited 
to those described, supra. 

This embodiment of the invention is well suited to screen chemical libraries for 
molecules which modulate, e.g., inhibit, antagonize, or agonize, the amount of, activity 
of, or protein component composition of the complex. The chemical libraries can be 
peptide libraries, peptidomimetic libraries, chemically synthesized libraries, recombinant, 
e.g., phage display libraries, and in vitro translation-based libraries, other non-peptide 
synthetic organic libraries, etc. 

Exemplary libraries are commercially available from several sources (ArQuIe, 
Tripos/PanLabs, ChemDesign, Pharmacopoeia). In some cases, these chemical 
libraries are generated using combinatorial strategies that encode the identity of each 
member of the library on a substrate to which the member compound is attached, thus 
allowing direct and immediate identification of a molecule that is an effective modulator. 
Thus, in many combinatorial approaches, the position on a plate of a compound specifies 
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that compound’s composition. Also, in one example, a single plate position may have 
from 1-20 chemicals that can be screened by administration to a well containing the 
interactions of interest. Thus, if modulation is detected, smaller and smaller pools of 
interacting pairs can be assayed for the modulation activity. By such methods, many 
candidate molecules can be screened. 

Many diversity libraries suitable for use are known in the art and can be used to 
provide compounds to be tested according to the present invention. Alternatively, 
libraries can be constructed using standard methods. Chemical (synthetic) libraries, 
recombinant expression libraries, or polysome-based libraries are exemplary types of 
libraries that can be used. 

The libraries can be constrained or semirigid (having some degree of structural 
rigidity), or linear or nonconstrained. The library can be a cDNA or genomic expression 
library, random peptide expression library or a chemically synthesized random peptide 
library, or non-peptide library. Expression libraries are introduced into the cells in which 
the assay occurs, where the nucleic acids of the library are expressed to produce their 
encoded proteins. 

In one embodiment, peptide libraries that can be used in the present invention 
may be libraries that are chemically synthesized in vitro. Examples of such libraries are 
given in Houghten et al., 1991, Nature 354:84-86, which describes mixtures of free 
hexapeptides in which the first and second residues in each peptide were individually 
and specifically defined; Lam et al., 1991, Nature 354:82-84, which describes a "one 
bead, one peptide" approach in which a solid phase split synthesis scheme produced a 
library of peptides in which each bead in the collection had immobilized thereon a single, 
random sequence of amino acid residues; Medynski, 1994, Bio/Technology 12:709-710, 
which describes split synthesis and T-bag synthesis methods; and Gallop et al., 1994, J. 
Medicinal Chemistry 37(9):1 233-1 251 . Simply by way of other examples, a 
combinatorial library may be prepared for use, according to the methods of Ohlmeyer et 
al., 1993, Proc. Natl. Acad. Sci. USA 90:10922-10926; Erb et al., 1994, Proc. Natl. Acad. 
Sci. USA 91:1 1422-1 1426; Houghten etal., 1992, Biotechniques 13:412; Jayawickreme 
et al., 1994, Proc. Natl. Acad. Sci. USA 91:1614-1618; or Salmon et al., 1993, Proc. Natl. 
Acad. Sci. USA 90:11708-11712. PCT Publication No. WO 93/20242 and Brenner and 
Lerner, 1992, Proc. Natl. Acad. Sci. USA 89:5381-5383 describe "encoded combinatorial 
chemical libraries," that contain oligonucleotide identifiers for each chemical polymer 
library member. 
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In a preferred embodiment, the library screened is a biological expression library 
that is a random peptide phage display library, where the random peptides are 
constrained (e.g., by virtue of having disulfide bonding). 

Further, more general, structurally constrained, organic diversity (e.g., 
nonpeptide) libraries, can also be used. By way of example, a benzodiazepine library 
( See e.g., Bunin et al., 1994, Proc. Natl. Acad. Sci. USA 91 :4708-4712) may be used. 

Conformationally constrained libraries that can be used include but are not limited 
to those containing invariant cysteine residues which, in an oxidizing environment, cross- 
link by disulfide bonds to form cystines, modified peptides (e.g., incorporating fluorine, 
metals, isotopic labels, are phosphorylated, etc.), peptides containing one or more 
non-naturally occurring amino acids, non-peptide structures, and peptides containing a 
significant fraction of -carboxyglutamic acid. 

Libraries of non-peptides, e.g., peptide derivatives (for example, that contain one 
or more non-naturally occurring amino acids) can also be used. One example of these 
are peptoid libraries (Simon et al., 1992, Proc. Natl. Acad. Sci. USA 89:9367-9371). 
Peptoids are polymers of non-natural amino acids that have naturally occurring side 
chains attached not to the alpha carbon but to the backbone amino nitrogen. Since 
peptoids are not easily degraded by human digestive enzymes, they are advantageously 
more easily adaptable to drug use. Another example of a library that can be used, in 
which the amide functionalities in peptides have been permethylated to generate a 
chemically transformed combinatorial library, is described by Ostresh et al., 1994, Proc. 
Natl. Acad. Sci. USA 91:1 1138-1 1142). 

The members of the peptide libraries that can be screened according to the 
invention are not limited to containing the 20 naturally occurring amino acids. In 
particular, chemically synthesized libraries and polysome based libraries allow the use of 
amino acids in addition to the 20 naturally occurring amino acids (by their inclusion in the 
precursor pool of amino acids used in library production). In specific embodiments, the 
library members contain one or more non-natural or non-classical amino acids or cyclic 
peptides. Non-classical amino acids include but are not limited to the D-isomers of the 
common amino acids, -amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric 
acid;. -Abu, -Ahx, 6-amino hexanoic acid; Aib, 2-amino isobutyric acid; 3-amino propionic 
acid; ornithine; norleucine; norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t- 
butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, 8-alanine, designer amino 
acids such as IJ-methyl amino acids, p-methyl amino acids, N-methyl amino acids, 
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fluoro-amino acids and amino acid analogs in general. Furthermore, the amino acid can 
be D (dextrorotary) or L (levorotary). 

In a specific embodiment, fragments and/or analogs of complexes of the 
invention, or protein components thereof, especially peptidomimetics, are screened for 
activity as competitive or non-competitive inhibitors of complex activity or formation. 

In another embodiment of the present invention, combinatorial chemistry can be 
used to identify modulators of a the complexes. Combinatorial chemistry is capable of 
creating libraries containing hundreds of thousands of compounds, many of which may 
be structurally similar. While high throughput screening programs are capable of 
screening these vast libraries for affinity for known targets, new approaches have been 
developed that achieve libraries of smaller dimension but which provide maximum 
chemical diversity. (See e.g., Matter, 1997, Journal of Medicinal Chemistry 40:1219- 
1229). 

One method of combinatorial chemistry, affinity fingerprinting, has previously 
been used to test a discrete library of small molecules for binding affinities for a defined 
panel of proteins. The fingerprints obtained by the screen are used to predict the affinity 
of the individual library members for other proteins or receptors of interest (in the instant 
invention, the protein complexes of the present invention and protein components 
thereof.) The fingerprints are compared with fingerprints obtained from other compounds 
known to react with the protein of interest to predict whether the library compound might 
similarly react. For example, rather than testing every ligand in a large library for 
interaction with a complex or protein component, only those ligands having a fingerprint 
similar to other compounds known to have that activity could be tested. (See, e.g., 

Kauvar et al., 1995, Chemistry and Biology 2:107-118; Kauvar, 1995, Affinity 
fingerprinting, Pharmaceutical Manufacturing International. 8:25-28; and Kauvar, Toxic- 
Chemical Detection by Pattern Recognition in New Frontiers in Agrochemical 
Immunoassay, D. Kurtz. L. Stankerand J.H. Skerritt. Editors, 1995, AOAC: Washington, 
D.C., 305-312). 

Kay et al., 1993, Gene 128:59-65 (Kay) discloses a method of constructing 
peptide libraries that encode peptides of totally random sequence that are longer than 
those of any prior conventional libraries. The libraries disclosed in Kay encode totally 
synthetic random peptides of greater than about 20 amino acids in length. Such libraries 
can be advantageously screened to identify complex modulators. (See also U.S. Patent 
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No. 5,498,538 dated March 12, 1996; and PCT Publication No. WO 94/18318 dated 
August 18, 1994). 

A comprehensive review of various types of peptide libraries can be found in 
Gallop etal., 1994, J. Med. Chem. 37:1233-1251. 

5.7. PHARMACEUTICAL COMPOSITIONS AND THERAPEUTIC/PROPHYLACTIC 
ADMINISTRATION 

The invention provides methods of treatment (and prophylaxis) by administration 
to a subject of an effective amount of a Therapeutic of the invention. In a preferred 
aspect, the Therapeutic is substantially purified. The subject is preferably an animal 
including, but not limited to animals such as cows, pigs, horses, chickens, cats, dogs, 
etc., and is preferably a mammal, and most preferably human. In a specific embodiment, 
a non-human mammal is the subject. 

Various delivery systems are known and can be used to administer a Therapeutic 
of the invention, e.g., encapsulation in liposomes, microparticles, and microcapsules: use 
of recombinant cells capable of expressing the Therapeutic, use of receptor-mediated 
endocytosis (e.g., Wu and Wu, 1987, J. Biol. Chem. 262:4429-4432); construction of a 
Therapeutic nucleic acid as part of a retroviral or other vector, etc. Methods of 
introduction include but are not limited to intradermal, intramuscular, intraperitoneal, 
intravenous, subcutaneous, intranasal, epidural, and oral routes. The compounds may 
be administered by any convenient route, for example by infusion, by bolus injection, by 
absorption through epithelial or mucocutaneous linings (e.g., oral, rectal and intestinal 
mucosa, etc.), and may be administered together with other biologically active agents. 
Administration can be systemic or local. In addition, it may be desirable to introduce the 
pharmaceutical compositions of the invention into the central nervous system by any 
suitable route, including intraventricular and intrathecal injection; intraventricular injection 
may be facilitated by an intraventricular catheter, for example, attached to a reservoir, 
such as an Ommaya reservoir. Pulmonary administration can also be employed, e.g., by 
use of an inhaler or nebulizer, and formulation with an aerosolizing agent. 

In a specific embodiment, it may be desirable to administer the pharmaceutical 
compositions of the invention locally to the area in need of treatment. This may be 
achieved by, for example, and not byway of limitation, local infusion during surgery, 
topical application, e.g., in conjunction with a wound dressing after surgery, by injection, 
by means of a catheter, by means of a suppository, or by means of an implant, said 
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implant being of a porous, non-porous, or gelatinous material, including membranes, 
such as sialastic membranes, or fibers. In one embodiment, administration can be by 
direct injection at the site (or former site) of a malignant tumor or neoplastic or pre- 
neoplastic tissue. 

In another embodiment, the Therapeutic can be delivered in a vesicle, in 
particular a liposome (Langer, 1990, Science 249:1527-1533; Treat et al., 1989, In; 
Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler, 
eds., Liss, New York, pp. 353-365; Lopez-Berestein, ibid., pp. 317-327; see generally 
ibid.) 

In yet another embodiment, the Therapeutic can be delivered via a controlled 
release system. In one embodiment, a pump may be used (Langer, supra ; Sefton, 1987, 
CRC Crit. Ref. Biomed. Eng. 14:201-240; Buchwald etal., 1980, Surgery 88:507-516; 
Saudek etal., 1989, N. Engl. J. Med. 321:574-579). In another embodiment, polymeric 
materials can be used (Medical Applications of Controlled Release, Langer and Wise, 
eds., CRC Press, Boca Raton, Florida, 1974; Controlled Drug Bioavailability, Drug 
Product Design and Performance, Smolen and Ball, eds., Wiley, New York, 1984; 

Ranger and Peppas, 1983, Macromol. Sci. Rev. Macromol. Chem. 23:61; Levy et al., 
1985, Science 228:190-192; During et al., 1989, Ann. Neurol. 25:351-356; Howard et al., 
1989, J. Neurosurg. 71:858-863). In yet another embodiment, a controlled release 
system can be placed in proximity of the therapeutic target, i.e., the brain, thus requiring 
only a fraction of the systemic dose (e.g., Goodson, 1984, In: Medical Applications of 
Controlled Release, supra, Vol. 2, pp. 115-138). Other controlled release systems are 
discussed in the review by Langer (1990, Science 249:1527-1533). 

In a specific embodiment where the Therapeutic is a nucleic acid encoding a 
protein Therapeutic, the nucleic acid can be administered in vivo to promote expression 
of its encoded protein, by constructing it as part of an appropriate nucleic acid expression 
vector and administering it so that it becomes intracellular, e.g., by use of a retroviral 
vector (U.S. Patent No. 4,980,286), or by direct injection, or by use of microparticle 
bombardment (e.g., a gene gun; Biolistic, Dupont), or by coating it with lipids, cell-surface 
receptors or transfecting agents, or by administering it in linkage to a homeobox-like 
peptide which is known to enter the nucleus (e.g., Joliot et al., 1991 , Proc. Natl. Acad. 

Sci. USA 88:1864-1868), etc. Alternatively, a nucleic acid Therapeutic can be introduced 
intracellularly and incorporated by homologous recombination within host cell DNA for 
expression. 
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The present invention also provides pharmaceutical compositions. Such 
compositions comprise a therapeutically effective amount of a Therapeutic, and a 
pharmaceutically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regulatory agency of the Federal or 
a state government or listed in the U.S. Pharmacopeia or other generally recognized 
pharmacopeia for use in animals, and more particularly, in humans. The term "earner" 
refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is 
administered. Such pharmaceutical carriers can be sterile liquids, such as water and 
oils, including those of petroleum, animal, vegetable or synthetic origin, including but not 
limited to peanut oil, soybean oil, mineral oil, sesame oil and the like. Water is a 
preferred carrier when the pharmaceutical composition is administered orally. Saline and 
aqueous dextrose are preferred carriers when the pharmaceutical composition is 
administered intravenously. Saline solutions and aqueous dextrose and glycerol 
solutions are preferably employed as liquid carriers for injectable solutions. Suitable 
pharmaceutical excipients include starch, glucose, lactose, sucrose, gelatin, malt, rice, 
flour, chalk, silica gel, sodium stearate, glycerol monostearate, talc, sodium chloride, 
dried skim milk, glycerol, propylene, glycol, water, ethanol and the like. The composition, 
if desired, can also contain minor amounts of wetting or emulsifying agents, or pH 
buffering agents. These compositions can take the form of solutions, suspensions, 
emulsions, tablets, pills, capsules, powders, sustained-release formulations and the like. 
The composition can be formulated as a suppository, with traditional bjnders and carriers 
such as triglycerides. Oral formulation can include standard carriers such as 
pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, etc. Examples of suitable pharmaceutical 
carriers are described in "Remington's Pharmaceutical Sciences" by E.W. Martin. Such 
compositions will contain a therapeutically effective amount of the Therapeutic, 
preferably in purified form, together with a suitable amount of carrier so as to provide the 
form for proper administration to the patient. The formulation should suit the mode of 
administration. 

In a preferred embodiment, the composition is formulated, in accordance with 
routine procedures, as a pharmaceutical composition adapted for intravenous 
administration to human beings. Typically, compositions for intravenous administration 
are solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
also include a solubilizing agent and a local anesthetic such as lidocaine to ease pain at 
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the site of the injection. Generally, the ingredients are supplied either separately or 
mixed together in unit dosage form, for example, as a dry lyophilized powder or water- 
free concentrate in a hermetically sealed container such as an ampoule or sachette 
indicating the quantity of active agent. Where the composition is to be administered by 
infusion, it can be dispensed with an infusion bottle containing sterile pharmaceutical 
grade water or saline. Where the composition is administered by injection, an ampoule 
of sterile water or saline for injection can be provided so that the ingredients may be 
mixed prior to administration. 

The Therapeutics of the invention can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free carboxyl groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., those 
formed with free amine groups such as those derived from isopropylamine, triethylamine, 
2-ethylamino ethanol, histidine, procaine, etc., and those derived from sodium, 
potassium, ammonium, calcium, and ferric hydroxides, etc. 

The amount of the Therapeutic of the invention which will be effective in the 
treatment of a particular disorder or condition will depend on the nature of the disorder or 
condition, and can be determined by standard clinical techniques. In addition, in vitro 
assays may optionally be employed to help identify optimal dosage ranges. The precise 
dose to be employed in the formulation will also depend on the route of administration, 
and the seriousness of the disease or disorder, and should be decided according to the 
judgment of the practitioner and each patient's circumstances. However, suitable 
dosage ranges for intravenous administration are generally about 20-500 micrograms of 
active compound per kilogram body weight. Suitable dosage ranges for intranasal 
administration are generally about 0.01 pg/kg body weight to 1 mg/kg body weight. 
Effective doses may be extrapolated from dose-response curves derived from in vitro or 
animal model test systems. 

Suppositories generally contain active ingredient in the range of 0.5% to 10% by 
weight; oral formulations preferably contain 10% to 95% active ingredient. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention. Optionally associated with such containers) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 
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The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention. For example, the kit can comprise in one or more containers a first 
protein, or a functionally active fragment or functionally active derivative thereof, which 
first protein is selected from the group consisting of 

proteins listed in column A of table 1 of a given row; and a second protein, or a 
functionally active fragment or functionally active derivative thereof, which second protein 
is selected from the group consisting of proteins listed in column B of table 1 of said row. 
Alternatively, the kit can comprise in one or more containers, ail proteins, functionally 
active fragments or functionally active derivatives thereof of from the group of proteins in 
column C of table 1. 

The kits of the present invention can also contain expression vectors encoding the 
essential components of the complex machinery, which components after being 
expressed can be reconstituted in order to form a biologically active complex. Such a kit 
preferably also contains the required buffers and reagents. Optionally associated with 
such containers) can be instructions for use of the kit and/or a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

5.8 ANIMAL MODELS . . 

The present invention also provides animal models. In one embodiment, animal 
models for diseases and disorders involving the protein complexes of the present 
invention are provided. These animal models are well known in the art. These animal 
models include, but are not limited to those which are listed in the section 5.6 (supra) as 
exemplary animald models to study any of the complexes provided in the invention. Such 
animals can be initially produced by promoting homologous recombination or insertional 
mutagenesis between genes encoding the protein components of the complexes in the 
chromosome, and exogenous genes encoding the protein components of the complexes 
that have been rendered biologically inactive or deleted (preferably by insertion of a 
heterologous sequence, e.g., an antibiotic resistance gene). In a preferred aspect, 
homologous recombination is carried out by transforming embryo-derived stem (ES) cells 
with one or more vectors containing one or more insertionally inactivated genes, such 
that homologous recombination occurs, followed by injecting the transformed ES cells 
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into a blastocyst, and implanting the blastocyst into a foster mother, followed by the birth 
of the chimeric animal ("knockout animal") in which a gene encoding a component 
protein from column A of table 1 of a given row, or a functionally active fragment or 
functionally active derivative thereof, and a gene encoding a component protein from 
column B of table 1 of said row, or a functionally active fragment or functionally active 
derivative thereof, has been inactivated or deleted (Capecchi, 1989, Science 
244:1288-1292).. 

In another preferred aspect, homologous recombination is carried out by transforming 
embryo-derived stem (ES) cells with one or more vectors containing one or more 
insertionally inactivated genes, such that homologous recombination occurs, followed by 
injecting the transformed ES cells into a blastocyst, and implanting the blastocyst into a 
foster mother, followed by the birth of the chimeric animal ("knockout animal") in which 
the genes of all component proteins from the group of proteins listed in column C of table 
1 or of all proteins from the group of proteins listed in columb D of table 1 have been 
inactivated or deleted. 

The chimeric animal can be bred to produce additional knockout animals. 
Such animals can be mice, hamsters, sheep, pigs, cattle, etc., and are preferably non- 
human mammals. In a specific embodiment, a knockout mouse is produced. 

Such knockout animals are expected to develop, or be predisposed to developing, 
diseases or disorders associated with mutations involving the protein complexes of the 
present invention, and thus, can have use as animal models of such diseases and 
disorders, e.g., to screen for or test molecules (e.g., potential Therapeutics) for such 
diseases and disorders. 

In a different embodiment of the invention, transgenic animals that have 
incorporated and express (or over-express or mis-express) a functional gene encoding a 
protein component of the complex, e.g. by introducing the a gene encoding one or more 
of the components of the complex under the control of a heterologous promoter ( i.e ., a 
promoter that is not the native promoter of the gene) that either over-expresses the 
protein or proteins, or expresses them in tissues not normally expressing the complexes 
or proteins, can have use as animal models of diseases and disorders characterized by 
elevated levels of the protein complexes. Such animals can be used to screen or test 
molecules for the ability to treat or prevent the diseases and disorders cited supra. 
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In one embodiment, the present invention provides a recombinant non-human 
animal in which an endogenous gene encoding a first protein, or a functionally active 
fragment or functionally active derivative thereof, which first protein is selected 
from the group of proteins of column A of table 2 of a given complex, and and 
endogenous gene encoding a second protein, or a functionally active fragment or 
functionally active derivative thereof, which second protein is selected from the group 
consisting of proteins of column B, of table 2 of said complex has been deleted or 
inactivated by homologous recombination or insertional mutagenesis of said animal or an 
ancestor thereof. In addition, the present invention'provides a recombinant non-human 
animal in which the endogenous genes of all proteins, or functionally active fragments or 
functionally active derivatives thereof of one of the group of proteins listed in column C 
have been deleted or inactivated by homologous recombination or insertional 
mutagenesis of said animal or an ancestor thereof: 

In another embodiment, the present invention provides a recombinant non-human 
animal in which an endogenous gene encoding a first protein, or a functionally active 
fragment or functionally active derivative thereof, which first protein is selected from the 
group consisting of proteins of column A of table 2 of a given complex, and endogenous 
gene encoding a second protein, or a functionally active fragment or functionally active 
derivative thereof, which second protein is selected from the group consisting of proteins 
of column B, of table 2 of said complex are recombinantly expressed in said animal or an 
ancestor thereof. 

The following series of examples are presented by way of illustration and not by 
way of limitation on the scope of the invention. 



EXAMPLES 

By applying the process according to the invention to the isolation of the 
polyadenylation/cleavage machinery from yeast, which is further described below, thirty- 
two new proteins could be identified in said yeast complex. 

Purifications have been done using different proteins as bait according to the protocols 
stated further below. 
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Below is a more detailed list of the newly identified components of the polyadenylation 
complex (see also Tab. 1). The Accession-Number stated is the GenBank-Accession 
number for the protein. 

Protein patterns for some of the purifications are shown in Figures 3 and 4. 

Actl : Is a known and essential protein (GenBank Acc. No. BAA21512.1), which has been 
shown to be involved in Pol II transcription and has been found to be associated with 
histone acetylation. It serves as a structural protein. 

Ckal: Is a known and non-essential protein (GenBank Acc. No. CAA86916.1), which has 
been found to be involved in Polymerase III transcription and has been found to be 
associated with the Casein kinase II complex. 

Eft2: The translation elongation factor EF-2 is a known protein involved in protein 
synthesis (GenBank AAB64827.1) 

Eno2: Is a known and essential protein (GenBank Acc. No. AAB68019.1). It has been 
shown to have lyase activity and is known to be involved in carbohydrate metabolism. 

Glc7 (YER133w) is also a known protein (GenBank Acc. No. AAC03231.1). It is also an 
essential protein and is a Type I protein serine threonine phosphatase which has been 
implicated in distinct cellular roles, such as carbohydrate metabolism, meiosis, mitosis 
and cell polarity. Its occurrence in the cleavage/polyadenylation machinery has not been 
known before. 

Gpml: This protein is a phosphoglycerate mutase that converts 2-phosphoglyvcerate to 
3-phosphoglycerate in glycolysis. It is an essential protein (GenBank: CAA81 994.1) 

Hhf2: Is a known and non-essential protein (GenBank Acc. No. CAA95892.1) which has 
been shown to be involved in DNA-binding. It has previously been linked to Histone 
octamer and the RNA polymerase I upstream activation factor. 
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Htal: Is a known and non-essential protein (GenBank Acc. No. CAA88505.1) which has 
DNA-binding capability and has been shown to be involved in polymerase II transcription. 

Hsc82: Is a non-essential protein so far being associated with protein folding. (GenBank 
Acc. No: CAA89919.1) 

Imd2: Is an lnosine-5'-monophosphate dehydrogenase so far being associated with 
nucleotide metabolism. It is non-essential. (GenBank Acc.-No.: AAB69728.1) 

Imd4: Is a non-essential protein with similiarity to Imd2 so far being associated with 
nucleotide metabolism (GenBank Acc-No.: CAA86719.1) 

Met6: Is a homocysteine methyltransferase so far being associated with amino-acid 
metabolism (GenBank Acc.-No.: AAB64646.1) 

Pdcl: Is a pyruvate decarboxylase isozymel so far being associated with carbohydrate 
metabolism (GenBank Acc.-No.: CAA97573.1) 

Pfkl: Is a known protein (GenBank Acc. No. CAA97268.1) which has previously been 
described as part of the phosphofructokinase complex. 

Ref2 (YDR195w) is a known protein (GenBank Acc. No. GAA88708.1). It is a non- 
essential gene product. It has been shown to be involved in 3-end formation prior to the 
final polyadenylation step. However, Ref2 has never been identified before as a 
component of the 3-end processing machinery. Ref2 has been shown to interact with 
Glc7, another new component of the cleavage/polyadenylation machinery. 

Sec13: Is a known and essential protein (GenBank Acc. No AAB67426.1). 

Sec31: Is a known and essential protein (GenBank Acc. No. CAA98772.1) 

Ssa3: Is a known and non-essential protein (GenBank Acc. No. CAA84896.1 ) which so 
far has been implicated with protein folding/protein transport. 
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Ssu72 (YNL222w) is also a known protein (GenBank Acc. No. CAA96125.1) and is an 
essential phylogenetically conserved protein which has been shown to interact with the 
general transcription factor TFIIB (Sua7). TFIIB is an essential component of the RNA 
polymerase II (RNAP II) core transcriptional machinery. It is thought that this interaction 
plays a role in the mechanism of start site selection by RNAP II. The finding according to 
the present invention that Ssu72 is associated with Ptal is likely to be relevant since it is 
believed that mRNA 3'-end formation is linked with other nuclear processes like 
transcription, capping and splicing. Furthermore, Ssu 72 has also been clearly identified 
in a "reverse tagging experiment" as explained herein below by using some of the Ptal 
associated proteins as bait. However, when Ssu72 itself was used as a bait associated 
proteins were not found most likely due to the fact that the addition of a C-terminal tag 
renders Ssu72 non-functional. 

Taf60: Is a known and essential protein (GenBank Acc. No. CAA9681 9.1) which has 
been shown to be involved in Polymerase II transcription. 

Tkll: Is a non-essential transketolase so far being associated with amino-acid 
metabolism and carbohydrate metabolism (GenBank Acc-No.: CAA89191.1) 

Tsai: Translation initiation factor elF5 which so far has been to shown to catalyze 
hydrolysis of GTP on the 40S ribosomal subunit-initiation complex followed by joining to 
60S ribosomal subunit. (GenBank Acc.-No.: CAA92145.1) • 

Tye7: Is a known protein (GenBank Acc. No. CAA99671.1). It has been shown to be a 
basic helix-loop-helix transcription factor. 

Vid24: Is a known and non-essential protein (GenBank Acc. No. CAA89320.1) which has 
previously been associated with protein degradation and vesicular transport. 

Vps53: Is a known protein (GenBank Acc. No. CAA89320.1) which has been found to 
play a role in protein sorting. 

YCL046w: Is a non-essential protein (GenBank Acc. No. CAA42371.1). 
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YGR156w is the protein product of an essential gene. This protein also contains a RNA 
binding motif. (GenBank Acc. No. CAA97170.1). 

YHL035c: Is a known and non-essential protein (GenBank Acc. No. AAB65047.1). It is a 
member of the ATP-binding cassette superfamily. 

YKL018w is also an essential protein containing a WD40 domain which is a typical 
protein binding domain. (GenBank Acc. No. CAA81 853.1) 

YLR221c: Is a protein of unknown function (GenBank Acc. NO.AAB67410.1) 

YML030w: Is a protein of unknown function (GenBank Acc. No. CAA86625.1) 

YOR179c shows significant sequence similarity to Yshl (GenBank Acc. No. 

CAA99388.1) 

Two further proteins for which binary interactions with members of the polyadenylation 
complex as known so far have been shown before have also been purified with the 
complex: 

YKL059c: is the product of an essential gene and is a zinc binding protein containing a 
C2HC Zinc finger. The presence of this domain predicts a RNA binding function of 
YKL059c. We believe the corresponding gene product is identical to Pfsl , a protein 
which has been mentioned in several publications, but which has never been annotated 
in the databases (for review see Keller, W. and Minvielle-Sebastia (1997). Curr Opin Cell 
Biol IT. 352-357). (GenBank Acc. No. CAA81 896.1) 

Tif4632: Is a known and non-essential protein (GenBank Acc. No. CAA96751.1) which 
has been shown to have an RNA-binding/translation factor activity and is involved in 
protein synthesis. 



Below is a description of the experimental steps and protocols as used herein: 
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The initial round of purification of the complex was carried out using Pta-1 as a bait as 
described below: 

CONSTRUCTION OF A YEAST STRAIN EXPRESSING TAP-TAGGED Ptal 
The construction of these strains is illustrated both in Figure 2 and table 5. 
PURIFICATION OF PROTEINS ASSOCIATED WITH PTA1 

The TAP-technology, which is more fully described in WO/0009716 and in Rigaut, G. et. 
al. (1999), Nature Biotechnology. Vol. 17 (10): 1030-1032 respectively was used for 
protein complex purification. The Ptal protein was C-terminally tagged with a TAP-tag 
which consists of calmodulin-binding peptide (CBP), a cleavage site for TEV protease 
followed by two IgG-binding units of protein A (Rigaut, G. et. al. (1999), Nature 
Biotechnology. Vol. 17 (10): 1030-1032). Ptal is an essential protein which has been 
reported to be a component of PFI. Ptal-TAP was used as a bait to identify associated 
partners from cell lysates using the two-step TAP purification procedure. Proteins were 
separated by ID gel electrophoresis and visualized by staining with Coomassie. More 
than a total of 20 bands could be detected on the gel (see Fig. 3). The identity of the 
proteins was determined by mass spectrometry. 13 of these are known components of 
the pre-mRNA processing machinery: Cftl, Cft2, Ysh, Ptal, Rna14, Pabl, Pcfll, Papl, 
Clpl, Pfs2, Fipl, Rna15 and Ythl. It is to be noted that such a comprehensive number 
has never before been purified together in form of a complex. The remaining seven 
proteins have not previously been found associated with Ptal: Ref2, YK059c, 
YGR156w, YKL018w, Glc7, Ssu72 and YOR179c. 

VALIDATION OF INTERACTIONS FOUND WITH Ptal 

A reciprocal experiment to the one described above was performed. For this purpose a 
subset of the interactors found in the above described Pta purification (both known and 
novel interactors) were chosen as a bait for a further round of purification (the baits used 
herein are listed in the first column of Table 1). In the case of some proteins the C- 
terminally tagged versions could not be recovered. The likely reason for this is that the 
addition of the TAP tag at the C-terminus interferes with the function of these proteins. 
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An important fact is that almost all of the known components involved in 3'-end formation 
and five of the seven novel proteins identified herein are essential for cell viability. The 
protein pattern obtained in some of those experiments is shown in Figure 4. 

The construction of the strains was carried out as described for the strain expressing the 
TAP-tagged Pta-1. 



SEQUENCE ANALYSIS OF MEMBERS OF THE COMPLEX 

The process of mRNA processing is highly conserved in eukaryots. Accordingly, for a 
number of the yeast proteins human orthologues could be found (see Table 2). This 
illustrates that many of the functions found in the yeast complex can be transferred to 
humans. Also the enzymatic activity of this complex has long been known, the 
enzymatically active member could not yet be unraveled. Using extensive sequence 
similarity searches it could be shown that Yshl is homologous to a class of bacterial 
beta-lactamases. The active center of this protein family contains 2 zinc ions which are 
bound by histidines. As these residues are conserved in Yshl and it was shown that 
enzymatic activity of the yeast complex is zinc dependent predicted that Yshl is 
responsible for the catalytic activity of the complex. Two other proteins found in the 
complex, Cft2 and YOR179c, are homologous to the Yshl N- and C-terminus, 
respectively. Though Cft2 is homologous to the enzymatic region of. Yshl it misses the . 
zinc binding histidines indicating that it lacks enzymatic activity. Thus, Cft 2 and 
YOR179c could compete with Yshl for the same binding slot of the complex, suggesting 
a novel type of regulation of polyadenylation. A similar way of regulation might be used in 
the case of Pfs2 and YKL018W, which both consist of multiple WD40 domains. 

PREDICTION OF MAMMALIAN PROTEINS 

To allow the transfer of function information from yeast to human proteins, we did not 
only use an identity cutoff, but also the 'orthology' concept. Orthology defines genes 
which arose via a speciation event, in contrast to genes which arose via gene 
duplication. Orthologue genes are supposed to perform the same function in different 
organisms, therefore more detailed function information can be transferred. The 
algorithm for the detection of orthologous gene pairs from yeast and human uses the 
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whole genome of these organisms. First, pairwise best hits were retrieved, using a full 
Smith-Waterman alignment of predicted proteins. To further improve reli-ability, these 
pairs were clustered with pairwise best hits involving Drosophila melanogaster and 
Caenorhabditis elegans proteins. See "Initial sequencing and analysis of the human 
genome", Nature 2001 Feb 15; 409(6822):860-921 for a detailed description of the 
analysis. 

Bioinformatic analysis of the Complex: 

Functional domains of all members of the complex were analyzed using SMART 
(SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids 
Res 2000 Jan 1; 28(1):231-4) and Pfam (Pfam: protein families database, Nucleic Acids 
Res 2000 Jan 1; 28(1):263-6). 

COMPARISON OF THE YEAST AND MAMMALIAN CLEAVAGE/POLYADENYLATION 
MACHINERY 

The sequence of many of the polypeptides involved in 3’-end formation are conserved 
form yeast to mammals, although the sequence elements on the substrate pre-mRNA 
differ (see Figure 1). 



The detailed experimental protocols for the example stated herein- are given below: 
PROTOCOLS: 

ISOLATION OF PROTEIN COMPLEXES: 
a) ISOLATION FO COMPLEXES FROM YEAST: 

Yeast strain construction: 

Yeast strains expressing TAP-tagged ORFs were constructed in a semi-automated way 
essentially according to Rigautet. al. (Rigaut, G. et. al. Nat Biotechnol 17, 1030-2 
(1999)) and Puig et al. (Puig, O. et al. Methotds 24, 218-19. (2001)) (See also Fig. 2 and 
Table 5) 
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TAP-purification using the Pta-1-tagged strains:: 

Ptal -tagged strain was cultured in 4 1 of YPD medium to an OD600 of 2. 

After harvesting, the cell pellet was frozen in liquid nitrogen and stored at 
-80°C. All further manipulations were done at 4°C except where noted. For preparation of 
protein lysates the cells were resuspended in lysis buffer (50 mM Tris-HCI pH 7.5, 100 
mM NaCI, 0.15 % NP-40, 1.5 mM MgCI2, 0.5 mM DTT, protease inhibitors) and 
subjected to mechanical disruption with glass beads. Lysates were clarified by two 
successive centrifugation steps at 20.000 x g for 10 min and 100.000 x g for 1 hour. After 
addition of glycerol to 5 % final concentration the lysates were frozen in liquid nitrogen 
and stored at -80°C. 

For the first purification step 500 pi of rabbit IgG-Agarose (50:50 slurry, Sigma A2909) 
pre-equilibrated in lysis buffer were added to the lysate and the sample was rotated for 2 
hours. The unbound fraction was discarded and the beads with the bound material were 
transferred to a 0.8 ml column (MoBiTec Ml 002, 90 pm filter). The beads were washed 
with 10 ml of lysis buffer followed by 5 ml of TEV cleavage buffer (10 mM Tris-HCI pH 
8.0, 100 mM NaCI, 0.1 % NP-40, 0.5 mM EDTA, 1 mM DTT). 

150 pi of TEV cleavage buffer and 4 pi of TEV protease were added to the column and 
the sample was incubated on a shaker at 16 °C for 2 hours. The eluate was recovered by 
pressing with a syringe. 

150 pi of Calmodulin dilution buffer (10 mM Tris-HCI pH 8.0, 100 mM NaCI, 0.1 % NP-40, 
2 mM MgAc, 2 mM imidazole, 4 mM CaCI2, 1 mM DTT) was added to the previous 
eluate and this mixture was transferred to a MoBiTec column containing 300 pi (bead 
volume) of Calmodulin affinity resin (Stratagene #214303) which was prewashed in 
Calmodulin wash buffer (10 mM Tris-HCI pH 8.0, 100 mM NaCI, 0.1 % NP-40, 1 mM 
MgAc, 1 mM imidazole, 2 mM CaCI2, 1 mM DTT). The samples were rotated for 1 hour 
at 4 °C. 
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After washing of the beads with 10 ml of Calmodulin wash buffer, protein complexes 
were eluted with 600 pi of elution buffer (10 mM Tris-HCI pH 8.0, 5 mM EDTA). The 
samples were concentrated in siliconised tubes in a speed vac to a final volume of 10-20 
pi. Proteins were detected by polyacrylamide gel electrophoresis followed by staining 
with colloidal Coomassie blue. 



General TAP-purification protocol for soluble proteins: 

TAP-purification of soluble proteins: 

The purification was done from 2 liters of yeast cells grown to late log phase 
(OD 6 oo ~3 - 4). Cells were harvested and the pellet was frozen in liquid nitrogen and 
stored at -80 °C. All steps were done at 4°C. For preparation of protein lysates the cells 
were resuspended in buffer A (50 mM Tris-HCI pH 7.5, 100 mM NaCI, 0.15 % NP-40, 1.5 
mM MgCb, 0.5 mM DTT, protease inhibitors) and subjected to mechanical disruption with 
glass beads. Lysates were clarified by two successive centrifugation steps at 20.000 x g 
for 10 min and 100,000 x g for 1 hour. After addition of glycerol to 5 % final concentration 
the lysates were frozen in liquid nitrogen and stored at -80°C. 

For the first purification step 500 pi of rabbit IgG-Agarose (50:50 slurry, Sigma A2909) 
pre-equilibrated in buffer A were added to the lysate and the sample was rotated for 1 . 
hour. The unbound fraction was discarded and the beads with the bound material were 
transferred to a 0.8 ml column (MoBiTec Ml 002, 90 pm filter). The beads were washed 
with 10 ml of buffer A. 

150 pi of buffer A and 4 pi of TEV protease (1 mg/ml) were added to the column and the 
sample was incubated on a shaker at 16°C for 1 hour. The eluate was recovered by 
pressing with a syringe. 

150 pi of buffer A containing 4 mM CaCl 2 was added to the previous eluate and this 
mixture was transferred to a MoBiTec column containing 300 pi (bead volume) of 
Calmodulin affinity resin (Stratagene #214303) which was prewashed in buffer A 
containing 2 mM CaC^. The samples were rotated for 1 hour at 4°C. 

After washing of the beads with 5 ml of buffer A containing 2 mM CaCI 2 , protein 
complexes were eluted with 600 pi of elution buffer (10 mM Tris-HCI pH 8.0, 5 mM 
EGTA). The samples were concentrated in siliconized tubes in a speed vac. Proteins 
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were detected by polyacrylamide gel electrophoresis followed by staining with colloidal 
Coomassie blue. 

TAP-purification of membrane proteins: 

The purification was done from 2 liters of yeast cells grown to late log phase (OD 6 oo ~3 - 
4). Cells were harvested and the pellet was frozen in liquid nitrogen and stored at -80 
°C. All steps were done at 4°C. For the purification of TAP-tagged membrane proteins 
cells were lysed in buffer containing 50 mM Hepes/KOH pH 7.5, 150 mM KCI, 0.25 % 
NP-40, 2 mM MgCI 2 , 2 mM EDTA, 0.5 mM DTT and protease inhibitors. The extracts 
were spun at 20,000 x g for 10 min and the resulting supernatant was adjusted to 1 .5 % 
NP-40 and 5 % glycerol. Samples were incubated for 30 min with end-over-end shaking 
and then centrifuged at 180,000 x g for 30 min. The resulting supernatant was 
immediately used for TAP-purification. 

For the first purification step 500 pi of rabbit IgG-Agarose (50:50 slurry, Sigma A2909) 
pre-equilibrated in buffer B (50 mM Tris-HCI pH 7.5, 100 mM NaCI, 0.5 % NP-40, 1.5 mM 
MgCI 2 , 0.5 mM DTT, protease inhibitors) was added to the lysate and the sample was 
rotated for 1 hour. The unbound fraction was discarded and the beads with the bound 
material were transferred to a 0.8 ml column (MoBiTec Ml 002, 90 pm filter). The beads 
were washed with 10 ml of buffer B. 

150 pi of buffer B and 8 pi of TEV protease (1 mg/ml) were added to the column and the 
sample was incubated on a shaker at 16°C for 1 hour. The eluate was recovered by 
pressing with a syringe. 

150 pi of buffer B containing 4 mM CaCI 2 was added to the previous eluate and this 
mixture was transferred to a MoBiTec column containing 300 pi (bead volume) of 
Calmodulin affinity resin (Stratagene #214303) which was prewashed in buffer B 
containing 2 mM CaCI 2 . The samples were rotated for 1 hour at 4°C. 

After washing of the beads with 5 ml of buffer B containing 2 mM CaCI 2 , protein 
complexes were eluted with 600 pi of elution buffer (10 mM Tris-HCI pH 8.0, 5 mM 
EGTA). The samples were concentrated in siliconized tubes in a speed vac. Proteins 
were detected by polyacrylamide gel electrophoresis followed by staining with colloidal 
Coomassie blue. 



b) ISOLATION OF COMPLEXES FROM MAMMALIAN CELLS 
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ISOLATION OF COMPLEXES FROM MAMMALIAN CELLS 
Cells: 

Retroviral transduction vectors containing the TAP-cassette were generated by 
directional cloning of PCR-amplified ORFs into a modified version of a MmoLV-based 
vector via the Gateway site-specific recombination sstem (Life Technologies). Virus 
stocks were generated in a HEK293 gag-pol packaging cell line by pseudotyping with 
VSV-G. Cells were infected and complexe were purified after cell expansion and 
cultivation of 3-5 using a modified TAP-protocoll 

Standard lysis protocol: 

The medium was removed from the culture dish and the ceils were scraped directly from 
the plate with help of a rubber policeman. The cells were collected on ice washed 3 times 
with PBS and resuspended in lysis buffer (50 mM Tris, pH: 7.5; 5 % glycerol; 0,2 % 
1GEPAL; 1.5 mM MgCI2; 1 mM DTT; 100 mM NaCi; 50 mM NaF; 1 mM Na3V04 + 
protease inhibitors). The cells were lysed for 30 min on ice, spun for 10 min. at 20,000g 
and re-spun for 1h at 100,000g. The supernatant was recovered, rapidly frozen in liquid 
nitrogen and stored at 

-80 °C. For pre-clearing the thawed lysate was incubated with 500 pi sepharose CL-4B 
beads (Amersham Pharmacia) for 1 h shaking and finally processed according the TAP 
protocol. 

Nuclear lysis protocol: 

The medium was removed from the culture dish and the cells were scraped directly from 
the plate with help of a rubber policeman. The cells were collected on ice washed 3 times 
with PBS and resuspended in buffer A (10 mM Tris-CI, pH 7.5; 1, 5 mM MgCI2; 10 mM 
KCI; 

1 mM DTT, 50 mM NaF; 1 mM Na3V04). To isolate the nuclei the lysate was dounced 
with a tight fitted pestle in a dounce homogenizerfor 15 strokes. The nuclei were 
harvested by centrifugation (10 min. at 2000 g and 20 min. at 16,000 g) and lysed in 
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buffer B (50 mM Tris-CI, pH: 7.5; 1.5 mM MgCI; 20 % glycerol; 420 mM NaCI; ImM DTT; 
50 mM NaF; 1 mM Na3V04) for 30 min. on ice with frequent shaking. The protein lysate 
was cleared by centrifugation (30 min at 100,000 g) and 1 : 4 diluted with buffer C (50 
mM Tris-CI, pH: 7.5; 1 mM DTT; 0.26 % NP40; 1.5 mM MgCI; 50 mM NaF; 1 mM 
Na3V04). After 30 min incubation on ice the lysate was re-spun for 30 min at 100,000 g, 
quickly frozen in liquid nitrogen and stored at -80 °C. For pre-clearing the thawed lysate 
was incubated with 500 pi sepharose CL-4B beads (Amersham Pharmacia) for 1 h 
shaking and finally processed according the TAP protocol. 



MASS SPECTROMETRIC ANALYSIS 

Protein digestion prior to mass spectrometric analysis: 

Gel-separated proteins were reduced, alkylated and digested in gel essentially following 
the procedure described by Shevchenko et al. (Shevchenko, A., Wilm, M., Vorm, O., 
Mann, M. Anal Chem 1996, 68, 850-858). Briefly, gel-separated proteins were excised 
from the gel using a clean scalpel, reduced using 10 mM DTT (in 5mM ammonium 
bicarbonate, 54 °C, 45 min) and subsequently alkylated with 55 mM iodoacetamid (in 5 
mM ammonium bicarbonate) at room temperature in the dark (30 min). Reduced and 
alkylated proteins were digested in gel with porcine trypsin (Promega) at a protease 
concentration of 12.5 ng/pl in 5mM ammonium bicarbonate. Digestion was allowed to 
proceed for 4 hours at 37 °C and the reaction was subsequently stopped using 2 pi 25% 
TFA. 

Desalting and concentration of peptides produced by in-gel digestion of gel-separated 
proteins: 

Peptides were desalted and concentrated using a prefabricated uZipTip (Millipore) 
reversed phase column. Peptides were eluted directly onto stainless steel MS sample 
holders using 2pl eluent (70% acetonitrile in 5% TFA containing 2mg/ml alpha-Cyano-4- 
hydroxy-cinnamic acid and two standard peptides for internal calibration of mass 
spectra). 



Mass spectrometric data acquisition: 
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Matrix-assisted laser desorption/ionisation (MALDI) time-of-flight (TOF) mass spectra 
were acquired in delayed extraction mode on a Voyager DE-STR PRO MALDI mass 
spectrometer (Applied Biosystems) equipped with a 337 nm nitrogen laser. 500 laser 
shots were averaged in order to produce final spectra. Spectra were automatically 
internally calibrated using two standard peptides. The monoisotopic masses for all 
peptide ion signals detected in the acquired spectra were determined and used for 
database searching. 

Protein sequence database searching using peptide mass fingerprinting (PMF) data: 

The list of monoisotopic peptide masses obtained from the MALDI mass spectrum was 
used to query a fasta formatted protein sequence database that contained all protein 
sequences from S. cerevesia. Proteins were identified by peptide mass fingerprinting 
(Mann, M., Hojrup, P., Roep-storff, P. Biol Mass Spectrom 1993, 22, 338-345; Pappin, 

D., Hojrup, P, Bleasby, AJ Curr. Biol. 1993, 3, 327-33; Henzel, W. J., Billed, T. M., 

Stults, J. T., Wong, S. C., Grimley, C., Watanabe, C. Proc Natl Acad Sci U S A 1993, 90, 
5011-5015; Yates, J. R., Speicher, S., Griffin, P. R., Hunkapiller, T. Anal Biochem 1993, 
214, 397-408; James, P., Quadroni, M., Carafoli, E., Gonnet, G. Biochem Biophys Res 
Commun 1993, 195, 58-64) using the software tool Profound (Proteometrics). In PMF, a 
protein is identified by correlating the measured peptide masses with theoretical digests 
of all proteins present in the database. Search criteria included: tryptic protein cleavage, 
monoisotopic masses, 30 ppm mass accuracy. No restrictions on protein size or 
isoelectric point were imposed. 

BIOINFORMATICS 

Functional and localization information about yeast proteins was retrieved from the Yeast 
Protein Database (YPD (Constanzo, M.C. et al., 2001, Nucl. Acid Res, 29: 75-9; Hodges, 
P.E. et al., 1999, Nucl. Acids Res 27: 69-73)) released in August 2001. In order to get a 
more concise classification for localization and function, YPD classes were merged. 
Protein domain analysis was performed using SMART (Schultz, J., Copley, R. R., 

Doerks, T., Ponting, C. P. & Bork, P. Schultz, J., Copley, R. R., Doerks, T., Ponting, C. P. 
& Bork, P. SMART, Nucleic Acids Res 28, 231-4. (2000)). PsiBlast (Altschul, S. F. etal. 
Gapped BLAST and PSI-BLAST: Nucleic Acids Res 25, 3389-402. (1997)) was used for 
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homology analysis. All additional analysis software has been developed in house, using 
Perl and Python. 



ASSAYS FOR ASSAYING THE ACTIVITIES OF THE COMPLEXES PRESENTED IN 
THE INVENTION 



An exemplary RNA binding assay can be carried out by contacting a complex having 
RNA binding activity with a radioactive [32P] end-labeled RNA substrate, e.g. a poly (A) 
RNA, under appropriate conditions and detecting bound protein. The detection of bound 
protein can be carried out, e.g., by filtrating the solution through a nitrocellulose filter and 
determining the radioactivity bound to the filter. This assay is based on the retention of 
nucleic acid-protein complexes on Nitrocellulose whereas free nucleic acid can pass 
through the filter 

(see e.g. Wahle, E„ 1991, Methods 66: 759-68) 

An exemplary RNA exonuclease assay can be carried out by contacting a complex 
having RNA exonuclease activity with a radioactivity [32 phosphate] end-labeled RNA 
substrate under appropriate conditions and detecting the release of free radioactive 
nucleotides. The detection of free radioactive nucleotides can be carried out, e.g., by 
adding 20% trichloroacetic acid, filtrating the solution through a filter and measuring the 
amount of acid-soluble radioactivity 
(see e.g. Ross, J., 1999, Methods 17: 52-9) 

An exemplary mRNA splicing assay can be carried out by contacting a complex having 
mRNA splicing activity with a radioactively labeled RNA substrate under appropriate 
conditions and detecting the release of spliced RNA species. The detection of spliced 
RNA species can be carried out, e.g., by fractionation of processed RNAs in a glycerol 
gradient and subsequent analysis by denaturing polyacrylamide gel elecrophoresis and 
visualization by autoradiography. 

(see e.g. Schwer, B. and Gross, CH., 1998, Methods17: 2086-94) 

An exemplary rRNA processing assay can be carried out by contacting a complex having 
rRNA processing activity with an pre-rRNA substrate under appropriate conditions and 
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detecting the release of free processed rRNA species. The detection of processed rRNA 
species can be carried out, e.g., using a primer extension or northern blotting assay by 
measuring the size of the rRNA species. 

(see e.g. Kressler, D. et al, 1997, Methods 17: 7283-94) 
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INDIVIDUAL YEAST PROTEINS OF THE COMPLEXES 
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PFK1 iPhosphofructokinase alpha subunit, part of a complex with Pfk2p which pther kinase; Carbohydrate metabolism 

catalyze ATP-dependent conversion of fructose-6-phosphate to fructose- Transferase 
1 ,6-bisphosphate, a key regulatory step in glycolysis 
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reticulum to Golgi transport, contains six WD (WD-40) repeats (Vesicular transport 








EC31 [Component (pi 50) of the COPII coat of secretory pathway vesicles Vesicle coat protein Vesicular transport 

involved in endoplasmic reticulum to Golgi transport, associated with 
Sec13p, member of WD (WD-40) repeat family 
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requirement for glycolytic gene expression 









Protein required for vacuolar import and degradation of Fbpl p (fructose- Unknown Protein degradation; 

1 ,6-bisphosphatase) Vesicular transport 

Subunit of the Vps52p-Vps53p-Vps54p complex, involved in protein Docking protein Vesicular transport 
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TABLE 3 



MEDICAL APPLICATION OF THE COMPLEX 



Name of 
complex 



Polyadenylation- 

complex 



RNA processing/ 
modification 



Cellular role 



Medical applications 



infectious diseases; viral infections such as 
herpes simplex infections, Epstein-Barr- 
infections, influenza; metabolic disease such as 
metachromatic leukodystrophy; 
neurodegenerative disorders such as 
amyotrophic lateral sclerosis; cancer 
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TABLE 4 



CHARACTERIZATION OF PREVIOUSLY UNDESCRIBED PROTEINS 



List of 
proteins of 
unknown 
functions 


Putative function 

. . _ 


Motifs found by sequence 
analysis 


predicted known 
orthologues 


YCL046W 




- 






_ — . — - ... . . 


RRM 




YHL035C 


multispecific organic 
anion transporter 1 , 
multidrug resistance- 
associated protein 2) 


3xTM, 2x PFAM: ABC- 
membrane domain, 2x AAA 


no ortholog, but 
high identity to 
MRP1-HUMAN 
(P33527), MRP3- 
HUMAN (015438), 
MRP2-HUMAN 
(Q92887) 


lllll 


guanine nucleotide 
binding protein 


4x WD40 




I jjmm 




- 




mi 




2x transmembrane 


Q9P297 (CLST 
11240 protein) 




polyadenylation 


- 












Legend 








wm 


ATPases associated with a variety of cellular activities 









WO 02/092626 



119 



PCT/EP02/05359 



"missing upon time of publication" 
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oligos: purchased from MWG, forwards and reverse primers are pre-mixed to a final 
concentration of 10 micromolar and delivered in a 96 well plate format 



step 




temperature 


Device 


prepare master mix (AmpliTaq, 
Perkin Elmer), containing the 
TAP cassette vector (1.5 ng/25 
microliter reaction) 


up to 2.5 ml 


4C master mix 


cooling platform 1 


Dispense the master mix in 96 
well plate = reaction plate (96 
well plate) 


23.5 microliters 


4C master mix + 
reaction plate 


cooling platforms 
1+2 


Add 1.0 microliters of oligo 
(final concentration: 0.4 
microM) to reaction plate 


1.0 microliters 


4C reaction plate 


cooling platform 2 










Cycle (30 cycles) the PCR 
reaction 






MJ thermocyclers 










Pipette 2 microliters from the 
reaction plate to a 96 well gel 
(Pharmacia) 


2 microliters 


r .. . 


Pharmacia ready 
^o run gels 


run gel 5 minutes 




i 

i 


Pharmacia, Ready- 
To-Run 



3) Yeast transformation 

General considerations: Procedure partially automated 

Materials: the haploid yeast strain is MGD453-13D: MATa, ade2, arg4, Ieu2-3,1 12, trpl- 
289,ura3-52. 



step 



volume temperature 



device 



"novelties" verus 
original protocole. 










WO 02/092626 



PCT/EP02/05359 



he day before, inoculate 
00ml of YPD with -100 
microliters of freshly growing 



east culture 



hen the culture reaches an 
OD600 of 1.0, dispense 2 ml of 
east culture in 24 well plate = 

ransformation plate (4) 



200 ml 



shaking 

incubator 



ses of 24 well 
plates (Qiagen) 



ia 



aspirate the sup 



essuspend cells in the 
remaining of media 



dispense 5 microliters of carrier 

DNA, (Herring testes Carrier 5 

DNA; 1 0 mg/ml; Clontech) microliters rt 



ransfer remaining of PCR from 
eaction plate to the 
ransformation plate 



add 500 microliters PEG/LiAc 



dd 55 microliters DMSO, mix 



ncubate 15' rt 



IS 



ncubate 15' at42C 
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elements 




transfer at rt + 700 microliters 
TE 


700 

microliters 


rt 






spin 




rt 






aspirate sup 




rt 






add 800 microliters YPD 




rt 






incubate 30' at 30 C+shake 




30 C 


heating 

platform+sh 

aker 




add 1ml of TE 




rt ' 






spin 




rt 






aspirate sup 




rt 






add 50 microliters TE 


■ 


rt 


8 tips 




mix 




rt 


shaker 




plate+ incubate for ~3 days at 
30C (10cm Petri dishes or 12- 
24 wellplates) 











4) Check PCR 

general considerations : fully automated. According to results of the transformation 0 to 6 
colonies are tested for homologous recombination. These results are filed in an excell file 
directly linked to the robot program. 

Material : the forward oligos are specific for each ORF (for te sequence cf 1); purchased 
from MWG at 10 micromolar in 96 well plates. The reverse oligo is constant for all ORFs 
and annealed in the TAP sequense 



dispense 20 microliters NaOH 








(20mM) in 96 well plates 


20 microliters 


rt 
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pickup colonies (0 to 6) in the 
96 well plate containing the 

NaOH=DNA plate (96 well) 



Boil 2' 



add 140 microliters of TE pH 
7.5 (8x dilution) 



pin at 2000 rpm 



Prepare master mix (PCR 
master, Roche) 



dispense 14.5 microliters in 

reaction plate (96 well plate) 14.5 microliters 




add 2.5 microliters of oligo 
(final concentration 1 microM) 
to reaction plate 


2.5 microliters 


transfer 8 microliters DNA from 
DNA plate to reaction plate 


8 microliters 






cooling platform 1 



cooling platform 2 

cooling platform 1 
cooling platform 1 



ycle (30 cycles) the PCR 
reaction 





add 1 microliters of LB 



load 10 microliters on a 96 well 
agarose gel 



n gel 5 minutes 



microliters 



10 microliters 




MJ thermocyclers 



Pharmacia ready to run 
gels 



Pharmacia, Ready-To-Run 



5) Dot blot analysis 



the remaining of the check PCR positive 
colonies are restreacked on -ura plates. 
Plates are incubated at 30C 






30C incubator 


the next day, the restreack is used to 
inoculate 2ml of YPD in 24 well plates 






30C shaker 
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TABLE 6 



KNOWN COMPONENTS OF THE YEAST mRNA 3’-END PROCESSING MACHINERY 



Factor 


Function 


Poly- 

peptid 

e 

comp. 

(kDa) 


Gene product / (ORF) 


Sequence motifs 


CFIA 


Cleavage and 


79.8 


Rna14 


8xHAT domains 




polyadenylation 


71.9 


(YMR061W) 








50.0 


Pcfll (YDR228C) 








32.8 


Clpl (YOR250C) 


RRM 








Rna15 (YGL044C) 






Polyadenylation 


64.2 




4xRRM IxPolyA 








Pabl (YER165W) 




CF IB 


Cleavage site 


73 


Hrpl (YOL123W) 






selection and 










polyadenylation 








CF ll/PF 1 


Cleavage and 


153.4 


Cftl/Yhhl (YDR301W) 




(= CPF) 


polyadenylation 


96.1 


Cft2/Ydh1 (YLR115W) 


Lactamase 






87.6 


Ysh1/Brr5 (YLR277c) 


NTP_transfer_2 
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CF: cleavage factor 

PF I: polyadenylation factor 

CstF: cleavage and stimulation factor 

CPSF: cleavage and polyadenylation specificity factor 

YGR156w: has RNA-binding domain 

Glc7: was found in Y2H using Ref2 as bait (Uetz screen) 

YOR179c: similar to Yshl (PF I complex) (37% identity, 56% similarity) 
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The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are intended to fall within the 
scope of the appended claims. 

Various publications are cited herein, the disclosures of which are incorporated by 
reference in their entireties. 
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CLAIMS 

1 . An isolated complex selected from complex (I) and comprising 

(a) a first protein, or a functionally active fragment or functionally active derivative 

thereof, which first protein is selected from the group consisting of: 

(i) Cftl (SEQ ID NO:3), or a mammalian homolog thereof, or a variant of Cftl 
encoded by a nucleic acid that hybridizes to the Cftl nucleic acid (SEQ ID NO:4) or its 
complement under low stringency conditions, 

(ii) Cft2 (SEQ ID NO:5), or a mammalian homolog thereof, or a variant of Cft2 
encoded by a nucleic acid that hybridizes to the Cft2 nucleic acid (SEQ ID NO:6) or its 
complement under low stringency conditions, 

(iii) Clpl (SEQ ID NO: 9), ora mammalian homolog thereof, ora variant of Clpl 
encoded by a nucleic acid that hybridizes to the Clpl nucleic acid (SEQ ID NO:10) or its 
complement under low stringency conditions, 

(iv) Fipl (SEQ ID NO: 19), or a mammalian homolog thereof, or a variant of Fipl 
encoded by a nucleic acid that hybridizes to the Fipl nucleic acid (SEQ ID NO:20) or its 
complement under low stringency conditions, 

(v) Pabl (SEQ ID N0.33), or a mammalian homolog thereof, or a variant of Pabl 
encoded by a nucleic acid that hybridizes to the Pabl nucleic acid (SEQ ID NO:34) or its 
complement under low stringency conditions, 

(vi) Papl (SEQ ID NO:35), or a mammalian homolog thereof, or a variant of Papl 
encoded by a nucleic acid that hybridizes to the Papl nucleic acid (SEQ ID NO:36) or its 
complement under low stringency conditions, 

(vii) Pcfl 1 (SEQ ID NO:37), or a mammalian homolog thereof, or a variant of 
Pcfl 1 encoded by a nucleic acid that hybridizes to the Pcfl 1 nucleic acid (SEQ ID 
NO:38) or its complement under low stringency conditions, 

(viii) Pfs2 (SEQ ID NO:43), or a mammalian homolog thereof, or a variant of Pfs2 
encoded by a nucleic acid that hybridizes to the Pfs2 nucleic acid (SEQ ID NO:44) or its 
complement under low stringency conditions, 

(ix) Ptal (SEQ ID NO:45), or a mammalian homolog thereof, or a variant of Ptal 
encoded by a nucleic acid that hybridizes to the Ptal nucleic acid (SEQ ID NO:46) or its 
complement under low stringency conditions, 
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(x) Rna14 (SEQ ID NO:49), or a mammalian homolog thereof, or a variant of 
Rna14 encoded by a nucleic acid that hybridizes to the Rna14 nucleic acid (SEQ ID 
NO:50) or its complement under low stringency conditions, 

(xi) Rna15 (SEQ ID NO:51), or a mammalian homolog thereof, or a variant of 
Rna15 encoded by a nucleic acid that hybridizes to the Rna15 nucleic acid (SEQ ID 
NO:52) or its complement under low stringency conditions, 

(xii) Tif4632 (SEQ ID NO:63), or a mammalian homolog thereof, or a variant of 
Tif4632 encoded by a nucleic acid that hybridizes to the Tif4632 nucleic acid (SEQ ID 
NO:64) or its complement under low stringency conditions, 

(xiii) Ykl059c (SEQ ID NO:89), or a mammalian homolog thereof, or a variant of 
Ykl059c encoded by a nucleic acid that hybridizes to the Ykl059c nucleic acid (SEQ ID 
NO:90) or its complement under low stringency conditions, 

(xiv) Yshl (SEQ ID NO:75), or a mammalian homolog thereof, or a variant of 
Yshl encoded by a nucleic acid that hybridizes to the Yshl nucleic acid (SEQ ID NO:76) 
or its complement under low stringency conditions, and 

(xv) Ythl (SEQ ID NO:77), or a mammalian homolog thereof, or a variant of Ythl 
encoded by a nucleic acid that hybridizes to the Ythl nucleic acid (SEQ ID NO:78) or its 
complement under low stringency conditions; and 

(b) a second protein, or a functionally active fragment or functionally active 
derivative thereof, which second protein is selected from the group consisting of: 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 

(ii) Ckal (SEQ ID NO:7), or a mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 

(iii) Eft2 (SEQ ID NO:11), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO: 12) or its 
complement under low stringency conditions, 

(iv) Eno2 (SEQ ID NO:13), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO:14) or its 
complement under low stringency conditions, 




WO 02/092626 



PCT/EP02/05359 



132 

(v) Glc7 (SEQ ID NO: 15), or a mammalian homolog thereof, or a variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO: 16) or its 
complement under low stringency conditions, 

(vi) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO: 18) or its complement under low stringency conditions, 

(vii) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 
complement under low stringency conditions, 

(viii) Htal (SEQ ID NO:23), ora mammalian homolog thereof, ora variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic acid (SEQ ID NO:24) or its 
complement under low stringency conditions, 

(ix) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nucleic acid that hybridizes to the Hsc82 nucleic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(x) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 

(xi) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizes to the Imd 4 nucleic acid (SEQ ID N0:30) or its 
complement under low stringency conditions, 

(xii) Met6 (SEQ ID NO:31), or a mammalian homolog thereof, or a variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 

(xiii) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, or a variant of 
Pdcl encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID N0:40) 
or its complement under low stringency conditions, 

(xiv) Pfkl (SEQ ID NO:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, or a variant of Ref2 
encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) or its 
complement under low stringency conditions, 
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(xvi) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Seel 3 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID 
NO:54) or its complement under low stringency conditions, 

(xvii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xviii) Ssa3 (SEQ ID NO:57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xix) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
NO:60) or its complement under low stringency conditions, 

(xx) Taf60 (SEQ ID NO:61), ora mammalian homolog thereof, or a variant of 
Taf60 encoded by a nucleic acid that hybridizes to the Taf60 nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxi) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of Tkll 
encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) or its 
complement under low stringency conditions, 

(xxii) Tsai (SEQ ID NO:67), or a mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 

(xxiii) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID NO:70) 
or its complement under low stringency conditions, 

(xxiv) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 

(xxv) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxvi) Yc!046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
Ycl046w encoded by a nucleic acid that hybridizes to the Ycl046w nucleic acid (SEQ ID 
NO:80) or its complement under low stringency conditions, 
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(xxvii) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, or a variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xxviii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID 
NO:84) or its complement under low stringency conditions, 

(xxix) Ykl018w (SEQ ID NO:85), or a mammalian homolog thereof, or a variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the Ykl018w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xxx) Ylr221 c (SEQ ID NO:87), or a mammalian homolog thereof, or a variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xxxi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, or a variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xxxii) Yor179c (SEQ ID NO:93), or a mammalian homolog thereof, or a variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, 

wherein said first protein and said second protein are members of a native cellular 
Polyadenylation-complex, and wherein said low stringency conditions comprise 
hybridization in a buffer comprising 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 

5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm 
DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 40°C, washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1.5 
hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 
mM EDTA, and 0.1% SDS for 1.5 hours at 60°C, and a complex (II) comprising at least 
two second proteins. 

2. An isolated complex comprising the following proteins: 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 
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(ii) Cftl (SEQ ID N0:3), or a mammalian homolog thereof, or a variant of Cftl 
encoded by a nucleic acid that hybridizes to the Cftl nucleic acid (SEQ ID NO:4) or its 
complement under low stringency conditions, 

(iii) Cft2 (SEQ ID NO:5), or a mammalian homolog thereof, or a variant of Cft2 
encoded by a nucleic acid that hybridizes to the Cft2 nucleic acid (SEQ ID NO:6) or its 
complement under low stringency conditions, 

(iv) Ckal (SEQ ID NO:7), ora mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 

(v) Clpl (SEQ ID NO:9), or a mammalian homolog thereof, or a variant of Clpl 
encoded by a nucleic acid that hybridizes to the Clpl nucleic acid (SEQ ID NO: 10) or its 
complement under low stringency conditions, 

(vi) Eft2 (SEQ ID NO:11), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO:12) or its 
complement under low stringency conditions, 

(vii) Eno2 (SEQ ID NO:1 3), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO:14) or its 
complement under low stringency conditions, 

(viii) Glc7 (SEQ ID NO:15), or a mammalian homolog thereof, or a variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO: 16) or its 
complement under low stringency conditions, 

(ix) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO: 18) or its complement under low stringency conditions, 

(x) Fipl (SEQ ID NO:19), or a mammalian homolog thereof, or a variant of Fipl 
encoded by a nucleic acid that hybridizes to the Fipl nucleic acid (SEQ ID NO:20) or its 
complement under low stringency conditions, 

(xi) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 
complement under low stringency conditions, 

(xii) Htal (SEQ ID NO:23), or a mammalian homolog thereof, or a variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic acid (SEQ ID NO:24) or its 
complement under low stringency conditions, 
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(xiii) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nucleic acid that hybridizes to the Hsc82 nucleic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(xiv) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 

(xv) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizesto the Imd4 nucleic acid (SEQ ID NO:30) or its 
complement under low stringency conditions, 

(xvi) Met6 (SEQ ID NO:31), or a mammalian homolog thereof, or a variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 

(xvii) Pabl (SEQ ID NO:33), or a mammalian homolog thereof, or a variant of 
Pabl encoded by a nucleic acid that hybridizes to the Pabl nucleic acid (SEQ ID NO:34) 
or its complement under low stringency conditions, 

(xviii) Papl (SEQ ID NO:35), or a mammalian homolog thereof, or a variant of 
Papl encoded by a nucleic acid that hybridizes to the Papl nucleic acid (SEQ ID NO:36) 
or its complement under low stringency conditions, 

(xix) Pcfl 1 (SEQ ID NO:37), or a mammalian homolog thereof, or a variant of 
Pcfll encoded by a nucleic acid that hybridizes to the Pcfll nucleic acid (SEQ ID 
NO:38) or its complement under low stringency conditions, 

(xx) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, or a variant of Pdcl 
encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID NO:40) or its 
complement under low stringency conditions, 

(xxi) Pfkl (SEQ ID NO:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xxii) Pfs2 (SEQ ID NO:43), or a mammalian homolog thereof, or a variant of Pfs2 
encoded by a nucleic acid that hybridizes to the Pfs2 nucleic acid (SEQ ID NO:44) or its 
complement under low stringency conditions, 

(xxiii) Ptal (SEQ ID NO:45), or a mammalian homolog thereof, or a variant of 
Ptal encoded by a nucleic acid that hybridizes to the Ptal nucleic acid (SEQ ID NO:46) 
or its complement under low stringency conditions, 
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(xxiv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, or a variant of 
Ref2 encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) 
or its complement under low stringency conditions, 

(xxv) Rna14 (SEQ ID NO:49), or a mammalian homolog thereof, or a variant of 
Rna14 encoded by a nucleic acid that hybridizes to the Rna14 nucleic acid (SEQ ID 
NO:50) or its complement under low stringency conditions, 

(xxvi) Rna15 (SEQ ID NO:51), or a mammalian homolog thereof, or a variant of 
Rna15 encoded by a nucleic acid that hybridizes to the Rna15 nucleic acid (SEQ ID 
NO:52) or its complement under low stringency conditions, 

(xxvii) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Seel 3 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID 
NO:54) or its complement under low stringency conditions, 

(xxviii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xxix) Ssa3 (SEQ ID NO:57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xxx) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
NO:60) or its complement under low stringency conditions, 

(xxxi) Taf60 (SEQ ID NO:61), or a mammalian homolog thereof, or a variant of 
Taf60 encoded by a nucleic acid that hybridizes to the Taf60 nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxxii) Tif4632 (SEQ ID NO:63), or a mammalian homolog thereof, or a variant of 
Tif4632 encoded by a nucleic acid that hybridizes to the Tif4632 nucleic acid (SEQ ID 
NO:64) or its complement under low stringency conditions, 

(xxxiii) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of 
Tkll encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) 
or its complement under low stringency conditions, 

(xxxiv) Tsai (SEQ ID NO:67), or a mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 
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(xxxv) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID NO:70) 
or its complement under low stringency conditions, 

(xxxvi) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 

(xxxvii) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxxviii) Yshl (SEQ ID NO:75), or a mammalian homolog thereof, or a variant of 
Yshl encoded by a nucleic acid that hybridizes to the Yshl nucleic acid (SEQ ID NO:76) 
or its complement under low stringency conditions, 

(xxxix) Ythl (SEQ ID NO:77), or a mammalian homolog thereof, or a variant of 
Ythl encoded by a nucleic acid that hybridizes to the Ythl nucleic acid (SEQ ID NO:78) 
or its complement under low stringency conditions, 

(xl) Ycl046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
Ycl046w encoded by a nucleic acid that hybridizes to the Ycl046w nucleic acid (SEQ ID 
NO: 80) or its complement under low stringency conditions, 

(xli) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, or a variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xlii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID ‘ " 
NO:84) or its complement under low stringency conditions, 

(xliii) Ykl018w (SEQ ID NO:85), or a mammalian homolog thereof, or a variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the Ykl018w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xliv) Ylr221c (SEQ ID NO:87), or a mammalian homolog thereof, or a variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xlv) Ykl059c (SEQ ID NO:89), or a mammalian homolog thereof, or a variant of 
Ykl059c encoded by a nucleic acid that hybridizes to the Ykl059c nucleic acid (SEQ ID 
N0:90) or its complement under low stringency conditions, 
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(xlvi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, or a variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xivii) Yor179c (SEQ ID NO:93), ora mammalian homolog thereof, ora variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, 
wherein said proteins are members of a native cellular Polyadenylation-complex, and 
wherein said low stringency conditions comprise hybridization in a buffer comprising 
35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran 
sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris- 
HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1.5 hours at 55“C, and washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 
hours at 60°C. 

3. An isolated complex that comprises all but 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17,18,19,20,21,22,23,24,25,26,27 or 28 of the following 47 proteins: 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 

(ii) Cftl (SEQ ID NO:3), or a mammalian homolog thereof, or a variant of Cftl 
encoded by a nucleic acid that hybridizes to the Cftl nucleic acid (SEQ ID NO:4) or its 
complement under low stringency conditions, 

(iii) Cft2 (SEQ ID NO:5), or a mammalian homolog thereof, or a variant of Cft2 
encoded by a nucleic acid that hybridizes to the Cft2 nucleic acid (SEQ ID NO:6) or its 
complement under low stringency conditions, 

(iv) Ckal (SEQ ID NO:7), or a mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 

(v) Clpl (SEQ ID NO:9), or a mammalian homolog thereof, or a variant of Clpl 
encoded by a nucleic acid that hybridizes to the Clpl nucleic acid (SEQ ID NO: 10) or its 
complement under low stringency conditions, 
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(vi) Eft2 (SEQ ID NO:1 1), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO: 12) or its 
complement under low stringency conditions, 

(vii) Eno2 (SEQ ID NO:13), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO: 14) or its 
complement under low stringency conditions, 

(viii) GIc7 (SEQ ID NO:15), or a mammalian homolog thereof, ora variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO: 16) or its 
complement under low stringency conditions, 

(ix) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO:18) or its complement under low stringency conditions, 

(x) Fipl (SEQ ID NO:19), or a mammalian homolog thereof, or a variant of Fipl 
encoded by a nucleic acid that hybridizes to the Fipl nucleic acid (SEQ ID NO:20) or its 
complement under low stringency conditions, 

(xi) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 
complement under low stringency conditions, 

(xii) Htal (SEQ ID NO:23), or a mammalian homolog thereof, or a variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic acid (SEQ ID NO:24) or its 
complement under low stringency conditions, 

(xiii) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nucleic acid that hybridizes to the Hsc82 nucleic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(xiv) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 

(xv) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizes to the Imd4 nucleic acid (SEQ ID NO:30) or its 
complement under low stringency conditions, 

(xvi) Met6 (SEQ ID NO:31), ora mammalian homolog thereof, ora variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 
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(xvii) Pabl (SEQ ID NO:33), or a mammalian homolog thereof, or a variant of 
Pabl encoded by a nucleic acid that hybridizes to the Pabl nucleic acid (SEQ ID NO:34) 
or its complement under low stringency conditions, 

(xviii) Papl (SEQ ID NO:35), or a mammalian homolog thereof, or a variant of 
Papl encoded by a nucleic acid that hybridizes to the Papl nucleic acid (SEQ ID NO:36) 
or its complement under low stringency conditions, 

(xix) Pcfl 1 (SEQ ID NO:37), or a mammalian homolog thereof, or a variant of 
Pcfll encoded by a nucleic acid that hybridizes to the Pcfll nucleic acid (SEQ ID 
NO:38) or its complement under low stringency conditions, 

(xx) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, or a variant of Pdcl 
encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID NO:40) or its 
complement under low stringency conditions, 

(xxi) Pfkl (SEQ ID NO:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xxii) Pfs2 (SEQ ID NO:43), or a mammalian homolog thereof, or a variant of Pfs2 
encoded by a nucleic acid that hybridizes to the Pfs2 nucleic acid (SEQ ID NO:44) or its 
complement under low stringency conditions, 

(xxiii) Ptal (SEQ ID NO:45), or a mammalian homolog thereof, or a variant of 
Ptal encoded by a nucleic acid that hybridizes to the Ptal nucleic acid (SEQ ID NO:46) 
or its complement under low stringency conditions, 

(xxiv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, or a variant of 
Ref2 encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) 
or its complement under low stringency conditions, 

(xxv) Rna14 (SEQ ID NO:49), or a mammalian homolog thereof, or a variant of 
Rna14 encoded by a nucleic acid that hybridizes to the Rna14 nucleic acid (SEQ ID 
NO:50) or its complement under low stringency conditions, 

(xxvi) Rna15 (SEQ ID NO:51), or a mammalian homolog thereof, or a variant of 
Rna15 encoded by a nucleic acid that hybridizes to the Rna15 nucleic acid (SEQ ID 
NO:52) or its complement under low stringency conditions, 

(xxvii) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Sec13 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID 
NO:54) or its complement under low stringency conditions, 
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(xxviii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xxix) Ssa3 (SEQ ID NO:57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xxx) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
NO:60) or its complement under low stringency conditions, 

(xxxi) Taf60 (SEQ ID NO:61), or a mammalian homolog thereof, or a variant of 
TafBO encoded by a nucleic acid that hybridizes to the Taf60 nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxxii) Tif4632 (SEQ ID NO:63), or a mammalian homolog thereof, or a variant of 
Tif4632 encoded by a nucleic acid that hybridizes to the Tif4632 nucleic acid (SEQ ID 
NO:64) or its complement under low stringency conditions, 

(xxxiii) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of 
Tkll encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) 
or its complement under low stringency conditions, 

(xxxiv) Tsai (SEQ ID NO:67), or a mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 

(xxxv) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID NO:70) 
or its complement under low stringency conditions, 

(xxxvi) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 

(xxxvii) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxxviii) Yshl (SEQ ID NO:75), or a mammalian homolog thereof, or a variant of 
Yshl encoded by a nucleic acid that hybridizes to the Yshl nucleic acid (SEQ ID NO:76) 
or its complement under low stringency conditions, 
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(xxxix) Ythl (SEQ ID NO:77), ora mammalian homolog thereof, or a variant of 
Ythl encoded by a nucleic acid that hybridizes to the Ythl nucleic acid (SEQ ID NO:78) 
or its complement under low stringency conditions, 

(xl) Ycl046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
YcI046w encoded by a nucleic acid that hybridizes to the Ycl046w nucleic acid (SEQ ID 
NO:80) or its complement under low stringency conditions, 

(xli) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, ora variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xlii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID 
NO:84) or its complement under low stringency conditions, 

(xliii) YklOl 8w (SEQ ID NO:85), or a mammalian homolog thereof, or a variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the YklOl 8w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xliv) Ylr221 c (SEQ ID NO:87), or a mammalian homolog thereof, or a variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xlv) Ykl059c (SEQ ID NO:89), or a mammalian homolog thereof, or a variant of 
Ykl059c encoded by a nucleic acid that hybridizes to the Ykl059c nucleic acid (SEQ ID 
N0:90) or its complement under low stringency conditions, 

(xlvi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, or a variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xlvii) Yor179c (SEQ ID NO:93), or a mammalian homolog thereof, ora variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, 
wherein said proteins are members of a native cellular Polyadenylation-complex, and 
wherein said low stringency conditions comprise hybridization in a buffer comprising 
35% formamide. 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran 
sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris- 
HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1.5 hours at 55°C, and washing in a buffer 
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consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 
hours at 60°C. 

4. The complex according to claim 2, which comprises all but 1 of the 47 proteins. 

5. The complex of claim 1 , 2, 3 or 4 comprising a functionally active derivative of said first 
protein and/or a functionally active derivative of said second protein, wherein the 
functionally active derivative is a fusion protein comprising said first protein or said 
second protein fused to an amino acid sequence different from the first protein or second 
protein, respectively. 

6. The complex of claim 1 , 2, 3 or 4 comprising a fragment of said first protein and/or a 
fragment of said second protein, which fragment binds to another protein component of 
said complex. 

7. The complex of claim 1 , 2, 3, 4, 5 or 6 that is involved in the 3' end processing activity 
for mRNA. 

8. The complex of claim 5 wherein the functionally active derivative is a fusion protein 
comprising said first protein or said second protein fused to an affinity tag or label. 

9. An antibody or a fragment of said antibody containing the binding domain thereof, 
which binds the complex of claim 1, 2, 3, 4, 5, 6 or 7 and which does not bind the first 
protein when uncomplexed or the second protein when uncomplexed. 

10. A process for processing RNA comprising the step of bringning into contact a product 
to any of claims 1-8 with RNA, such that the RNA is processed. 

11. A pharmaceutical composition comprising the protein complex of claim 1, 2, 3, 4, 5, 

6, 7 or 8; and a pharmaceutically acceptable carrier. 

12. A method for screening for a molecule that modulates directly or indirectly the 
function, activity, composition or formation of the complex of any one of claims 1-8 
comprising the steps of: 
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(a) exposing said complex, or a cell or organism containing said Polyadenylation- 
complex to one or more candidate molecules; and 

(b) determining the amount of 3' end processing activity for mRNA of, or protein 
components of, said complex, wherein a change in said amount, activity, or protein 
components relative to said amount, activity or protein components in the absence of 
said candidate molecules indicates that the molecules modulate function, activity or 
composition of said complex. 

13. The method of claim 12, wherein the amount of said complex is determined. 

14. The method of claim 12, wherein the activity of said complex is determined. 

15. The method of claim 14, wherein said determining step comprises isolating from the 
cell or organism said Polyadenylation-complex to produce said isolated complex and 
contacting said isolated complex with a RNA molecule such that the complex binds to 
the RNA. 

16. The method of claim 12, wherein the protein components of said complex are 
determined. 

17. The method of claim 16, wherein said determining step comprises determining 
whether 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 

(ii) Ckal (SEQ ID NO:7), or a mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 

(iii) Eft2 (SEQ ID NO:1 1), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO: 12) or its 
complement under low stringency conditions, 

(iv) Eno2 (SEQ ID NO: 13), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO:14) or its 
complement under low stringency conditions, 
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(v) Glc7 (SEQ ID NO:1 5), or a mammalian homolog thereof, or a variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO:16) or its 
complement under low stringency conditions, 

(vi) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO:1 8) or its complement under low stringency conditions, 

(vii) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 
complement under low stringency conditions, 

(viii) Htal (SEQ ID NO:23), or a mammalian homolog thereof, or a variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic acid (SEQ ID NO:24) or its 
complement under low stringency conditions, 

(ix) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nucleic acid that hybridizes to the Hsc82 nucleic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(x) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 

(xi) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizes to the Imd4 nucleic acid (SEQ ID NO:30) or its 
complement under low stringency conditions, 

(xii) Met6 (SEQ ID NO:31), or a mammalian homolog thereof, or a variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 

(xiii) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, or a variant of 
Pdcl encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID NO:40) 
or its complement under low stringency conditions, 

(xiv) Pfkl (SEQ ID NO:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, ora variant of Ref2 
encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) or its 
complement under low stringency conditions, 
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(xvi) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Seel 3 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID . 
NO:54) or its complement under low stringency conditions, 

(xvii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xviii) Ssa3 (SEQ ID NO:57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the’Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xix) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
N0:60) or its complement under low stringency conditions, 

(xx) Taf60 (SEQ ID NO:61), or a mammalian homolog thereof, or a variant of 
Taf60 encoded by a nucleic acid that hybridizes to the TafBO nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxi) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of Tkll 
encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) or its 
complement under low stringency conditions, 

(xxii) Tsai (SEQ ID NO:67), or a mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 

(xxiii) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID NO:70) 
or its complement under low stringency conditions, 

(xxiv) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 

(xxv) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxvi) Ycl046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
Ycl046w encoded by a nucleic acid that hybridizes to the Yc!046w nucleic acid (SEQ ID 
N0:80) or its complement under low stringency conditions, 
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(xxvii) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, or a variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xxviii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID 
NO:84) or its complement under low stringency conditions, 

(xxix) Ykl018w (SEQ ID NO:85), or a mammalian homolog thereof, ora variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the Ykl018w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xxx) Ylr221c (SEQ ID NO:87), or a mammalian homolog thereof, or a variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xxxi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, ora variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xxxii) Yor179c (SEQ ID NO:93), or a mammalian homolog thereof, ora variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, is present in the complex, 
wherein said low stringency conditions comprise hybridization in a buffer comprising 
35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran 
sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris- 
HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1.5 hours at 55°C, and washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1 .5 
hours at 60°C. 

18. The method of any of claim 12 to 17, wherein said method is a method of screening 
for a drug for treatment or prevention of a disease or disorder such as infectious 
diseases; viral infections such as herpes simplex infections, Epstein-Barr-infections, 
influenza; metabolic disease such as metachromatic leukodystrophy; neurodegenerative 
disorders such as amyotrophic lateral sclerosis; cancer. 

19. A method for screening for a molecule that binds the complex of anyone of claim 1 - 
8 comprising the following steps: 
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(a) exposing said complex, or a cell or organism containing said Polyadenylation- 
complex, to one or more candidate molecules; and 

(b) determining whether said complex is bound by any of said candidate 
molecules. 

20. A method for diagnosing or screening for the presence of a disease or disorder or a 
predisposition for developing a disease or disorder in a subject, which disease or 
disorder is characterized by an aberrant amount of 3' end processing activity for mRNA 
of, or component composition of, the complex of any one of the claim 1-8, comprising 
determining the amount of, 3’ end processing activity for mRNA of, or protein 
components of, said complex in a sample derived from a subject, wherein a difference in 
said amount, activity, or protein components of, said complex in an analogous sample 
from a subject not having the disease or disorder or predisposition indicates the 
presence in the subject of the disease or disorder or predisposition. 

21. The method of claim 20, wherein the amount of said complex is determined. 

22. The method of claim 20, wherein the activity of said complex is determined. 

23. The method of claim 22, wherein said determining step comprises isolating from the 
subject said Polyadenylation-complex to produce said isolated complex and contacting 
said isolated complex with a RNA molecule such that the complex binds to the RNA. 

24. The method of claim 20, wherein the protein components of said complex are 
determined. 

25. The method of claim 24, wherein said determining step comprises determining 
whether 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 

(ii) Ckal (SEQ ID NO:7), or a mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 
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(iii) Eft2 (SEQ ID NO:1 1), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO: 12) or its 
complement under low stringency conditions, 

(iv) Eno2 (SEQ ID NO: 13), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO:14) or its 
complement under low stringency conditions, 

(v) Glc7 (SEQ ID NO: 15), or a mammalian homolog thereof, or a variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO: 16) or its 
complement under low stringency conditions, 

(vi) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO: 1 8) or its complement under low stringency conditions, 

(vii) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 

“complement under low stringency conditions, 

(viii) Htal (SEQ ID NO:23), or a mammalian homolog thereof, or a variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic acid (SEQ ID NO:24) or its 
complement under low stringency conditions, 

(ix) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nucleic acid that hybridizes to the Hsc82 nucleic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(x) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 

(xi) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizes to the Imd4 nucleic acid (SEQ ID N0:30) or its 
complement under low stringency conditions, 

(xii) Met6 (SEQ ID NO:31), or a mammalian homolog thereof, or a variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 

(xiii) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, ora variant of 
Pdcl encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID N0:40) 
or its complement under low stringency conditions, 
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(xiv) Pfkl (SEQ ID N0:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, or a variant of Ref2 
encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) or its 
complement under low stringency conditions, 

(xvi) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Seel 3 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID 
NO:54) or its complement under low stringency conditions, 

(xvii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xviii) Ssa3 (SEQ ID NO:57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xix) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
NO:60) or its complement under low stringency conditions, 

(xx) Taf60 (SEQ ID NO:61), or a mammalian homolog thereof, or a variant of 
Taf60 encoded by a nucleic acid that hybridizes to the Taf60 nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxi) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of Tkll 
encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) or its 
complement under low stringency conditions, 

(xxii) Tsai (SEQ ID NO:67), ora mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 

(xxiii) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID NO:70) 
or its complement under low stringency conditions, 

(xxiv) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 
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(xxv) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxvi) Ycl046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
Ycl046w encoded by a nucleic acid that hybridizes to the Ycl046w nucleic acid (SEQ ID 
NO:80) or its complement under low stringency conditions, 

(xxvii) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, or a variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xxviii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID 
NO:84) or its complement under low stringency conditions, 

(xxix) Ykl018w (SEQ ID NO:85), or a mammalian homolog thereof, or a variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the Ykl018w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xxx) Ylr221c (SEQ ID NO:87), or a mammalian homolog thereof, ora variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xxxi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, or a variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xxxii) Yor179c (SEQ ID NO:93), or a mammalian homolog thereof, or a variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, is present in the complex, 
wherein said low stringency conditions comprise hybridization in a buffer comprising 
35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm DNA, and 10% (wt/vol) dextran 
sulfate for 18-20 hours at 40°C, washing in a buffer consisting of 2X SSC, 25 mM Tris- 
HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1 .5 hours at 55°C, and washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1 .5 
hours at 60°C. 

26. A method for treating or preventing a disease or disorder characterized by an 
aberrant amount of, 3’ end processing activity for mRNA of, or component composition 
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of, the complex of anyone of claim 1-8, comprising administering to a subject in need of 
such treatment or prevention a therapeutically effective amount of one or more 
molecules that modulate the amount of, 3’ end processing activity for mRNA of, or 
protein components of, said complex. 

27. The method according to claim 26, wherein said disease or disorder involves 
decreased levels of the amount or activity of said complex. 

28. The method according to claim 26, wherein said disease or disorder involves 
increased levels of the amount or activity of said complex. 

29. Use of a molecule that modulates the amount of, 3’ end processing activity for mRNA 
of, or the protein components of the complex of any one of claim 1-8 for the manufacture 
of a medicament for the treatment or prevention of a disease or disorder such as 
infectious diseases; viral infections such as herpes simplex infections, Epstein-Barr- 
infections, influenza; metabolic disease such as metachromatic leukodystrophy; 
neurodegenerative disorders such as amyotrophic lateral sclerosis; cancer. 

30. A kit comprising in one or more containers 

(a) an isolated first protein, or a functionally active fragment or functionally active 
derivative thereof, which first protein is selected from the group consisting of: 

(i) Cftl (SEQ ID NO:3), or a mammalian homolog thereof, or a variant of Cftl 
encoded by a nucleic acid that hybridizes to the Cftl nucleic acid (SEQ ID NO:4) or its 
complement under low stringency conditions, 

(ii) Cft2 (SEQ ID NO:5), or a mammalian homolog thereof, or a variant of Cft2 
encoded by a nucleic acid that hybridizes to the Cft2 nucleic acid (SEQ ID NO:6) or its 
complement under low stringency conditions, 

(iii) Clpl (SEQ ID NO:9), or a mammalian homolog thereof, or a variant of Clpl 
encoded by a nucleic acid that hybridizes to the Clpl nucleic acid (SEQ ID NO:10) or its 
complement under low stringency conditions, 

(iv) Fipl (SEQ ID NO: 19), or a mammalian homolog thereof, or a variant of Fipl 
encoded by a nucleic acid that hybridizes to the Fipl nucleic acid (SEQ ID NO:20) or its 
complement under low stringency conditions, 
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(v) Pabl (SEQ ID NO:33), or a mammalian homolog thereof, or a variant of Pabl 
encoded by a nucleic acid that hybridizes to the Pabl nucleic acid (SEQ ID NO:34) or its 
complement under low stringency conditions, 

(vi) Papl (SEQ ID NO:35), or a mammalian homolog thereof, or a variant of Papl 
encoded by a nucleic acid that hybridizes to the Papl nucleic acid (SEQ ID NO:36) or its 
complement under low stringency conditions, 

(vii) Pcfl 1 (SEQ ID NO:37), or a mammalian homolog thereof, or a variant of 
Pcfl 1 encoded by a nucleic acid that hybridizes to the Pcfl 1 nucleic acid (SEQ ID 
NO:38) or its complement under low stringency conditions, 

(viii) Pfs2 (SEQ ID NO:43), or a mammalian homolog thereof, or a variant of Pfs2 
encoded by a nucleic acid that hybridizes to the Pfs2 nucleic acid (SEQ ID NO:44) or its 
complement under low stringency conditions, 

(ix) Ptal (SEQ ID NO:45), or a mammalian homolog thereof, or a variant of Ptal 
encoded by a nucleic acid that hybridizes to the Ptal nucleic acid (SEQ ID NO:46) or its 
complement under low stringency conditions, 

(x) Rna14 (SEQ ID NO:49), or a mammalian homolog thereof, or a variant of 
Rna14 encoded by a nucleic acid that hybridizes to the Rna14 nucleic acid (SEQ ID 
N0:50) or its complement under low stringency conditions, 

(xi) Rna15 (SEQ ID NO:51), or a mammalian homolog thereof, or a variant of 
Rna15 encoded by a nucleic acid that hybridizes to the Rna15 nucleic acid (SEQ ID 
NO:52) or its complement under low stringency conditions, 

(xii) Tif4632 (SEQ ID NO:63), or a mammalian homolog thereof, or a variant of 
Tif4632 encoded by a nucleic acid that hybridizes to the Tif4632 nucleic acid (SEQ ID 
NO:64) or its complement under low stringency conditions, 

(xiii) Ykl059c (SEQ ID NO:89), or a mammalian homolog thereof, or a variant of 
Ykl059c encoded by a nucleic acid that hybridizes to the Ykl059c nucleic acid (SEQ ID 
NO:90) or its complement under low stringency conditions, 

(xiv) Yshl (SEQ ID NO:75), or a mammalian homolog thereof, or a variant of 
Yshl encoded by a nucleic acid that hybridizes to the Yshl nucleic acid (SEQ ID NO:76) 
or its complement under low stringency conditions, and 

(xv) Ythl (SEQ ID NO:77), or a mammalian homolog thereof, or a variant of Ythl 
encoded by a nucleic acid that hybridizes to the Ythl nucleic acid (SEQ ID NO:78) or its 
complement under low stringency conditions; and 
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(b) a second protein, or a functionally active fragment or functionally active 
derivative thereof, which second protein is selected from the group consisting of: 

(i) Actl (SEQ ID NO:1), or a mammalian homolog thereof, or a variant of Actl 
encoded by a nucleic acid that hybridizes to the Actl nucleic acid (SEQ ID NO:2) or its 
complement under low stringency conditions, 

(ii) Ckal (SEQ ID NO:7), or a mammalian homolog thereof, or a variant of Ckal 
encoded by a nucleic acid that hybridizes to the Ckal nucleic acid (SEQ ID NO:8) or its 
complement under low stringency conditions, 

(iii) Eft2 (SEQ ID NO:11), or a mammalian homolog thereof, or a variant of Eft2 
encoded by a nucleic acid that hybridizes to the Eft2 nucleic acid (SEQ ID NO:12) or its 
complement under low stringency conditions, 

(iv) Eno2 (SEQ ID NO:13), or a mammalian homolog thereof, or a variant of Eno2 
encoded by a nucleic acid that hybridizes to the Eno2 nucleic acid (SEQ ID NO: 14) or its 
complement under low stringency conditions, 

(v) Glc7 (SEQ ID NO:1 5), or a mammalian homolog thereof, or a variant of Glc7 
encoded by a nucleic acid that hybridizes to the Glc7 nucleic acid (SEQ ID NO: 16) or its 
complement under low stringency conditions, 

(vi) Gpml (SEQ ID NO:17), or a mammalian homolog thereof, or a variant of 
Gpml encoded by a nucleic acid that hybridizes to the Gpml nucleic acid (SEQ ID 
NO: 18) or its complement under low stringency conditions, 

(vii) Hhf2 (SEQ ID NO:21), or a mammalian homolog thereof, or a variant of Hhf2 
encoded by a nucleic acid that hybridizes to the Hhf2 nucleic acid (SEQ ID NO:22) or its 
complement under low stringency conditions, 

(viii) Htal (SEQ ID NO:23), or a mammalian homolog thereof, or a variant of Htal 
encoded by a nucleic acid that hybridizes to the Htal nucleic add (SEQ ID NO:24) or its 
complement under low stringency conditions, 

(ix) Hsc82 (SEQ ID NO:25), or a mammalian homolog thereof, or a variant of 
Hsc82 encoded by a nudeic acid that hybridizes to the Hsc82 nudeic acid (SEQ ID 
NO:26) or its complement under low stringency conditions, 

(x) Imd2 (SEQ ID NO:27), or a mammalian homolog thereof, or a variant of Imd2 
encoded by a nucleic acid that hybridizes to the Imd2 nucleic acid (SEQ ID NO:28) or its 
complement under low stringency conditions, 
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(xi) Imd4 (SEQ ID NO:29), or a mammalian homolog thereof, or a variant of Imd4 
encoded by a nucleic acid that hybridizes to the imd4 nucleic acid (SEQ ID NO:30) or its 
complement under low stringency conditions, 

(xii) Met6 (SEQ ID NO:31), or a mammalian homolog thereof, or a variant of Met6 
encoded by a nucleic acid that hybridizes to the Met6 nucleic acid (SEQ ID NO:32) or its 
complement under low stringency conditions, 

(xiii) Pdcl (SEQ ID NO:39), or a mammalian homolog thereof, or a variant of 
Pdcl encoded by a nucleic acid that hybridizes to the Pdcl nucleic acid (SEQ ID NO:40) 
or its complement under low stringency conditions, 

(xiv) Pfkl (SEQ ID NO:41), or a mammalian homolog thereof, or a variant of Pfkl 
encoded by a nucleic acid that hybridizes to the Pfkl nucleic acid (SEQ ID NO:42) or its 
complement under low stringency conditions, 

(xv) Ref2 (SEQ ID NO:47), or a mammalian homolog thereof, or a variant of Ref2 
encoded by a nucleic acid that hybridizes to the Ref2 nucleic acid (SEQ ID NO:48) or its 
complement under low stringency conditions, 

(xvi) Seel 3 (SEQ ID NO:53), or a mammalian homolog thereof, or a variant of 
Seel 3 encoded by a nucleic acid that hybridizes to the Seel 3 nucleic acid (SEQ ID 
NO:54) or its complement under low stringency conditions, 

(xvii) Sec31 (SEQ ID NO:55), or a mammalian homolog thereof, or a variant of 
Sec31 encoded by a nucleic acid that hybridizes to the Sec31 nucleic acid (SEQ ID 
NO:56) or its complement under low stringency conditions, 

(xviii) Ssa3 (SEQ ID NO: 57), or a mammalian homolog thereof, or a variant of 
Ssa3 encoded by a nucleic acid that hybridizes to the Ssa3 nucleic acid (SEQ ID NO:58) 
or its complement under low stringency conditions, 

(xix) Ssu72 (SEQ ID NO:59), or a mammalian homolog thereof, or a variant of 
Ssu72 encoded by a nucleic acid that hybridizes to the Ssu72 nucleic acid (SEQ ID 
NO:60) or its complement under low stringency conditions, 

(xx) Taf60 (SEQ ID NO:61), or a mammalian homolog thereof, or a variant of 
Taf60 encoded by a nucleic acid that hybridizes to the Taf60 nucleic acid (SEQ ID 
NO:62) or its complement under low stringency conditions, 

(xxi) Tkll (SEQ ID NO:65), or a mammalian homolog thereof, or a variant of Tkll 
encoded by a nucleic acid that hybridizes to the Tkll nucleic acid (SEQ ID NO:66) or its 
complement under low stringency conditions, 
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(xxii) Tsai (SEQ ID NO:67), or a mammalian homolog thereof, or a variant of 
Tsai encoded by a nucleic acid that hybridizes to the Tsai nucleic acid (SEQ ID NO:68) 
or its complement under low stringency conditions, 

(xxiii) Tye7 (SEQ ID NO:69), or a mammalian homolog thereof, or a variant of 
Tye7 encoded by a nucleic acid that hybridizes to the Tye7 nucleic acid (SEQ ID N0:70) 
or its complement under low stringency conditions, 

(xxiv) Vid24 (SEQ ID NO:71), or a mammalian homolog thereof, or a variant of 
Vid24 encoded by a nucleic acid that hybridizes to the Vid24 nucleic acid (SEQ ID 
NO:72) or its complement under low stringency conditions, 

(xxv) Vps53 (SEQ ID NO:73), or a mammalian homolog thereof, or a variant of 
Vps53 encoded by a nucleic acid that hybridizes to the Vps53 nucleic acid (SEQ ID 
NO:74) or its complement under low stringency conditions, 

(xxvi) Ycl046w (SEQ ID NO:79), or a mammalian homolog thereof, or a variant of 
Ycl046w encoded by a nucleic acid that hybridizes to the Ycl046w nucleic acid (SEQ ID 
NO:80) or its complement under low stringency conditions, 

(xxvii) Ygr156w (SEQ ID NO:81), or a mammalian homolog thereof, or a variant of 
Ygr156w encoded by a nucleic acid that hybridizes to the Ygr156w nucleic acid (SEQ ID 
NO:82) or its complement under low stringency conditions, 

(xxviii) Yhl035c (SEQ ID NO:83), or a mammalian homolog thereof, or a variant of 
Yhl035c encoded by a nucleic acid that hybridizes to the Yhl035c nucleic acid (SEQ ID 
NO:84) or its complement under low stringency conditions, 

(xxix) Ykl018w (SEQ ID NO:85), or a mammalian homolog thereof, or a variant of 
Ykl018w encoded by a nucleic acid that hybridizes to the Ykl018w nucleic acid (SEQ ID 
NO:86) or its complement under low stringency conditions, 

(xxx) Ylr221c (SEQ ID NO:87), or a mammalian homolog thereof, or a variant of 
Ylr221c encoded by a nucleic acid that hybridizes to the Ylr221c nucleic acid (SEQ ID 
NO:88) or its complement under low stringency conditions, 

(xxxi) Yml030w (SEQ ID NO:91), or a mammalian homolog thereof, or a variant of 
Yml030w encoded by a nucleic acid that hybridizes to the Yml030w nucleic acid (SEQ ID 
NO:92) or its complement under low stringency conditions, and 

(xxxii) Yor179c (SEQ ID NO:93), or a mammalian homolog thereof, or a variant of 
Yor179c encoded by a nucleic acid that hybridizes to the Yor179c nucleic acid (SEQ ID 
NO:94) or its complement under low stringency conditions, 
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wherein said first protein and said second protein are members of a native cellular 
Polyadenylation-complex, and wherein said low stringency conditions comprise 
hybridization in a buffer comprising 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 

5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml denatured salmon sperm 
DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at 40°C, washing in a buffer 
consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1 % SDS for 1 .5 
hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 
mM EDTA, and 0.1% SDS for 1 .5 hours at 60°C. 

31 . A kit comprising in a container the isolated complex of any one of claim 1 - 8 or the 
antibody of claim 9. 

32. A kit for processing RNA comprising in a container the isolated complex of any of 
claims 1 -8 optionally together with further components such as reagents and working 
instructions. 

33. A kit for the diagnosis of a disease of mammals, preferentially for a disease or 
disorder such as infectious diseases; viral infections such as herpes simplex infections, 
Epstein-Barr-infections, influenza; metabolic disease such as metachromatic 
leukodystrophy; neurodegenerative disorders such as amyotrophic lateral sclerosis or 
cancer, comprising a product according to any of the claims 1 -8 optionally together with 
further components such as reagents and working instructions. 

34. The complex of any one of claim 1 - 8, or the antibody or fragment of claim 9, for use 
in a method of diagnosing a disease or disorder such as infectious diseases; viral 
infections such as herpes simplex infections, Epstein-Barr-infections, influenza; 
metabolic disease such as metachromatic leukodystrophy; neurodegenerative disorders 
such as amyotrophic lateral sclerosis; cancer. 

35. A method for the production of a pharmaceutical composition comprising carrying out 
the method of claim 12 or 19 to identify a molecule that modulates the function, activity, 
composition or formation of said complex, and further comprising mixing the identified 
molecule with a pharmaceutically acceptable carrier. 
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36. A process for preparing complex of claim 1 - 8 and optionally the components thereof 
comprising the following steps: 

expressing such a protein in a target cell, 

isolating the protein complex which is attached to the tagged protein, and 
optionally disassociating the protein complex and isolating the individual complex 
members. 

37. The process according to claim 36 characterized in that the tagged protein comprises 
two different tags which allow two separate affinity purification steps. 

38. The process according to any of claim 36 - 37 characterized in that two tags are 
separated by a cleavage site for a protease. 

39. Component of the Polyadenylation-complex obtainable by a process according to 
any of claim 36 - 38. 

40. Complex of claim 1-8 and/or protein thereof as a target for an active agent of a 
pharmaceutical, preferably a drug target in the treatment or prevention of a disease or 
disorder such as infectious diseases; viral infections such as herpes simplex infections, 
Epstein-Barr-infections, influenza; metabolic disease such as metachromatic 
leukodystrophy; neurodegenerative disorders such as amyotrophic lateral sclerosis; 
cancer. 

41. Component of the Polyadenylation-complex selected from 

a) yeast proteins 

(i) Ycl046w (SEQ ID NO:59), 

(ii) Ygr156w (SEQ ID NO:61), 

(iii) Yhl035c (SEQ ID NO:63), 

(iv) Ykl018w (SEQ ID NO:179), 

(v) Ylr221c (SEQ ID NO:67), 

(vi) Yml030w (SEQ ID NO:69), and 

(vii) Yor179c (SEQ ID NO:71). 

b) the mammalian homologs/orthologs of the proteins of (a), and 
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c) a functionally active fragment or functionally active derivate of the proteins 
according to (a) and (b) carrying one or more amino acid substitutions, deletions and/or 
additions. 

42. Component as described in claim 41 , characterized in that it is encoded by a nucleic . 
acid sequence which hybridizes to a nucleic acid sequence encoding any of the yeast 
proteins listed in claim 41 under low stringency conditions, wherein said low stringency 
conditions comprise hybridization in a buffer comprising 35% formamide, 5X SSC, 50 
mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 ug/ml 
denatured salmon sperm DNA, and 10% (wt/vol) dextran sulfate for 18-20 hours at40°C, 
washing in a buffer consisting of 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, and 

0.1 % SDS for 1.5 hours at 55°C, and washing in a buffer consisting of 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS for 1.5 hours at 60°C. 

43. Nucleic acid encoding a component to any of claims 41 and 42. 

44. Construct, preferably a vector construct, comprising 

(a) a nucleic add according to claim 41 and at least one further nucleic acid which is 
normally not associated with the nucleic acid according to claim 43, 

or 

(b) at least two separate nucleic acid sequences each encoding a different protein, or a 
functionally active fragment or a functionally active derivative thereof at least one of said 
proteins, or functionally active fragments or functionally active derivative thereof selected 
from the first group of proteins according to claim 1 (a) and at least one of said proteins, 
or functionally active fragments or functionally active derivative thereof selected from the 
second group of proteins according to claim 1 (b). 

45. Host cell containing a nudeic acid of claim 43 and/or a construct of claim 44 or 
containing several vectors comprising on different vectors the nucleic acid sequence 
encoding at least one of the proteins, or functionally active fragments or functionally 
active derivatives thereof selected from the first group of proteins according to claim 1(a) 
and at least one of the proteins, or functionally active fragments or functionally active 
derivatives thereof selected from the second group of proteins according to claim 1(b). 
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<110> Cellzome GmbH 

<12 0> Cleavage and Polyadenylation Complex of Precursor inRNA 

<13 0> PCT1619- 031/tc 

<140> 

<141> 

<150> EP 01111774.4 
<151> 2001-05-15 

<150> EP 01130253.6 
<151> 2001-12-2 0 

<160> 94 

<170> Patentln version 3.1 

< 210 > 1 
<211> 375 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 1 

Met Asp Ser Glu Val Ala Ala Leu Val lie Asp Asn Gly Ser Gly Met 
15 10 15 



Cys Lys Ala Gly Phe Ala Gly Asp Asp Ala Pro Arg Ala Val Phe Pro 
20 25 30 



Ser lie Val Gly Arg Pro Arg His Gin Gly lie Met Val Gly Met Gly 
35 40 45 



Gin Lys Asp Ser Tyr Val Gly Asp Glu Ala Gin Ser Lys Arg Gly lie 
50 55 60 



Leu Thr Leu Arg Tyr Pro lie Glu His Gly lie Val Thr Asn Trp Asp 
65 70 75 80 



Asp Met Glu Lys He Trp His His Thr Phe Tyr Asn Glu Leu Arg Val 
85 90 95 



Ala Pro Glu Glu His Pro Val Leu Leu Thr Glu Ala Pro Met Asn Pro 
100 105 110 



Lys Ser Asn Arg Glu Lys Met Thr Gin lie Met Phe Glu Thr Phe Asn 
115 120 125 



Val Pro Ala Phe Tyr Val Ser lie Gin Ala Val Leu Ser Leu Tyr Ser 
130 135 140 
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Ser Gly Arg Thr Thr Gly lie Val Leu Asp Ser Gly Asp Gly Val Thr 
145 150 155 160 



His Val Val Pro lie Tyr Ala Gly Phe Ser Leu Pro His Ala lie Leu 
165 170 175 



Arg lie Asp Leu Ala Gly Arg Asp Leu Thr Asp Tyr Leu Met Lys lie 
180 185 190 



Leu Ser Glu Arg Gly Tyr Ser Phe Ser Thr Thr Ala Glu Arg Glu lie 
195 200 205 



Val Arg Asp lie Lys Glu Lys Leu Cys Tyr Val Ala Leu Asp Phe Glu 
210 215 220 



Gin Glu Met Gin Thr Ala Ala Gin Ser Ser Ser He Glu Lys Ser Tyr 
225 230 235 240 



Glu Leu Pro Asp Gly Gin Val lie Thr lie Gly Asn Glu Arg Phe Arg 
245 250 255 



Ala Pro Glu Ala Leu Phe His Pro Ser Val Leu Gly Leu Glu Ser Ala 
260 265 270 



Gly lie Asp Gin Thr Thr Tyr Asn Ser lie Met Lys Cys Asp Val Asp 
275 280 285 



Val Arg Lys Glu Leu Tyr Gly Asn lie Val Met Ser Gly Gly Thr Thr 
290 295 300 



Met Phe Pro Gly lie Ala Glu Arg Met Gin Lys Glu lie Thr Ala Leu 
305 310 315 320 



Ala Pro Ser Ser Met Lys Val Lys lie lie Ala Pro Pro Glu Arg Lys 
325 330 335 



Tyr Ser Val Trp lie Gly Gly Ser lie Leu Ala Ser Leu Thr Thr Phe 
340 345 350 



Gin Gin Met Trp lie Ser Lys Gin Glu Tyr Asp Glu Ser Gly Pro Ser 
355 360 365 



lie Val His His Lys Cys Phe 
370 375 



< 210 > 2 
< 211 > 1125 
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<212> DNA 

<213 > Saccharoinyces cerevisiae 
<400> 2 



atggattctg 


gggttgctgc 


tttggttatt 


gataacggtt 


ctggtatgtg 


taaagccggt 


60 


tttgccggtg 


acgacgctcc 


tcgtgctgtc 


ttcccatcta 


tcgtcggtag 


accaagacac 


120 


caaggtatca 


tggtcggtat 


gggtcaaaaa 


gactcctacg 


ttggtgatga 


agctcaatcc 


180 


aagagaggta 


tcttgacttt 


acgttaccca 


attgaacacg 


gtattgtcac 


caactgggac 


240 


gatatggaaa 


agatctggca 


tcataccttc 


tacaacgaat 


tgagagttgc 


cccagaagaa 


300 


caccctgttc 


ttttgactga 


agctccaatg 


aaccctaaat 


caaacagaga 


aaagatgact 


360 


caaattatgt 


ttgaaacttt 


caacgttcca 


gccttctacg 


tttccatcca 


agccgttttg 


420 


tccttgtact 


cttccggtag 


aactactggt 


attgttttgg 


attccggtga 


tggtgttact 


480 


cacgtcgttc 


caatttacgc 


tggtttctct 


ctacctcacg 


ccattttgag 


aatcgatttg 


540 


gccggtagag 


atttgactga 


ctacttgatg 


aagatcttga 


gtgaacgtgg 


ttactctttc 


600 


tccaccactg 


ctgaaagaga 


aattgtccgt 


gacatcaagg 


aaaaactatg 


ttacgtcgcc 


660 


ttggacttcg 


aacaagaaat 


gcaaaccgct 


gctcaatctt 


cttcaattga 


aaaatcctac 


720 


gaacttccag 


atggtcaagt 


catcactatt 


ggtaacgaaa 


gattcagagc 


cccagaagct 


780 


ttgttccatc 


cttctgtttt 


gggtttggaa 


tctgccggta 


ttgaccaaac 


tacttacaac 


840 


tccatcatga 


agtgtgatgt 


cgatgtccgt 


aaggaattat 


acggtaacat 


cgttatgtcc 


900 


ggtggtacca 


ccatgttccc 


aggtattgcc 


gaaagaatgc 


aaaaggaaat 


caccgctttg 


960 


gctccatctt 


ccatgaaggt 


caagatcatt 


gctcctccag 


aaagaaagta 


ctccgtctgg 


1020 


attggtggtt 


ctatcttggc 


ttctttgact 


accttccaac 


aaatgtggat 


ctcaaaacaa 


1080 


gaatacgacg 


aaagtggtcc 


atctatcgtt 


caccacaagt 


gtttc 




1125 



<210> 3 

<211> 1357 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 3 

Met Asn Val Tyr Asp Asp Val Leu Asp Ala Thr Val Val Ser His Ser 
15 10 15 

Leu Ala Thr His Phe Thr Thr Ser Asp Tyr Glu Glu Leu Leu Val Val 
20 25 30 

Arg Thr Asn lie Leu Ser Val Tyr Arg Pro Thr Arg Asp Gly Lys Leu 
35 40 45 

Tyr Leu Thr Asp Glu Phe Lys Phe His Gly Leu He Thr Asp He Gly 
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50 55 60 



Leu lie Pro Gin Lys Asp Ser Pro Leu Ser Cys Leu Leu Leu Cys Thr 
65 70 75 80 



Gly Val Ala Lys lie Ser lie Leu Lys Phe Asn Thr Leu Thr Asn Ser 
85 90 95 



lie Asp Thr Leu Ser Leu His Tyr Tyr Glu Gly Lys Phe Lys Gly Lys 
100 105 110 



Ser Leu Val Glu Leu Ala Lys. lie Ser Thr Leu Arg Met Asp Pro Gly 
115 120 125 



Ser Ser Cys Ala Leu Leu Phe Asn Asn Asp He lie Ala Phe Leu- Pro 
130 135 140 



Phe His Val Asn Lys Asn Asp Asp Asp Glu Glu Glu Glu Asp Glu Asp 
145 150 155 160 



Glu Asn lie Asp Asp Ser Glu Leu lie His Ser Met Asn Gin Lys Ser 
165 170 175 



Gin Gly Thr Asn Thr Phe Asn Lys Arg Lys Arg Thr Lys Leu Gly Asp 
180 185 190 



Lys Phe Thr Ala Pro Ser Val Val Leu Val Ala Ser Glu Leu Tyr Glu 
195 200 205 



Gly Ala Lys Asn lie lie Asp lie Gin Phe Leu Lys Asn Phe Thr Lys 
210 215 220 



Pro Thr lie Ala Leu Leu Tyr Gin Pro Lys Leu Val Trp Ala Gly Asn 
225 230 235 240 



Thr Thr lie Ser Lys Leu Pro Thr Gin Tyr Val He Leu Thr Leu Asn 
245 250 255 



lie Gin Pro Ala Glu Ser Ala Thr Lys lie Glu Ser Thr Thr lie Ala 
260 265 270 



Phe Val Lys Glu Leu Pro Trp Asp Leu His Thr lie Val Pro Val Ser 
275 280 285 



Asn Gly Ala lie lie Val Gly Thr Asn Glu Leu Ala Phe Leu Asp Asn 
290 295 300 
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Thr Gly Val Leu Gin Ser Thr Val Leu Leu Asn Ser Phe Ala Asp Lys 
305 310 315 320 



Glu Leu Gin Lys Thr Lys He lie Asn Asn Ser Ser Leu Glu lie Met 
325 330 335 



Phe Arg Glu Lys Asn Thr Thr Ser lie Trp lie Pro Ser Ser Lys Ser 
340 345 350 



Lys Asn Gly Gly Ser Asn Asn Asp Glu Thr Leu Leu Leu Met Asp Leu 
355 360 365 



Lys Ser Asn lie Tyr Tyr lie Gin Met Glu Ala Glu Gly Arg Leu Leu 
370 375 380 



lie Lys Phe Asp lie Phe Lys Leu Pro lie Val Asn Asp Leu Leu Lys 
385 390 395 400 



Glu Asn Ser Asn Pro Lys Cys lie Thr Arg Leu Asn Ala Thr Asn Ser 
405 410 415 



Asn Lys Asn Met Asp Leu Phe lie Gly Phe Gly Ser Gly Asn Ala Leu 
420 425 430 



Val Leu Arg Leu Asn Asn Leu Lys Ser Thr lie Glu Thr Arg Glu Ala 
435 440 445 



His Asn Pro Ser Ser Gly Thr Asn Ser Leu Met Asp lie Asn Asp Asp 
450 455 460 



Asp Asp Glu Glu Met Asp Asp Leu Tyr Ala Asp Glu Ala Pro Glu Asn 
465 470 475 480 



Gly Leu Thr Thr Asn Asp Ser Lys Gly Thr Val Glu Thr Val Gin Pro 
485 490 495 



Phe Asp lie Glu Leu Leu Ser Ser Leu Arg Asn Val Gly Pro lie Thr 
500 505 510 



Ser Leu Thr Val Gly Lys Val Ser Ser lie Asp Asp Val Val Lys Gly 
515 520 525 



Leu Pro Asn Pro Asn Lys Asn Glu Tyr Ser Leu Val Ala Thr Ser Gly 
530 535 540 



Asn Gly Ser Gly Ser His Leu Thr Val He Gin Thr Ser Val Gin Pro 
545 550 555 560 
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Glu lie Glu Leu Ala Leu Lys Phe lie Ser lie Thr Gin lie Trp Asn 
565 570 575 



Leu Lys lie Lys Gly Arg Asp Arg Tyr Leu lie Thr Thr Asp Ser Thr 
580 585 590 



Lys Ser Arg Ser Asp lie Tyr Glu Ser Asp Asn Asn Phe Lys Leu His 
595 600 605 



Lys Gly Gly Arg Leu Arg Arg Asp Ala Thr Thr Val Tyr lie Ser Met 
610 615 620 



Phe Gly Glu Glu Lys Arg lie lie Gin Val Thr Thr Asn His Leu Tyr 
625 630 635 640 



Leu Tyr Asp Thr His Phe Arg Arg Leu Thr Thr lie Lys Phe Asp Tyr 
645 650 655 



Glu Val He His Val Ser Val Met Asp Pro Tyr lie Leu Val Thr Val 
660 665 670 



Ser Arg Gly Asp lie Lys lie Phe Glu Leu Glu Glu Lys Asn Lys Arg 
675 680 685 



Lys Leu Leu Lys Val Asp Leu Pro Glu lie Leu Asn Glu Met Val lie 
690 695 700 



Thr Ser Gly Leu lie Leu Lys Ser Asn Met Cys Asn Glu Phe Leu He 
705 710 715 720 



Gly Leu Ser Lys Ser Gin Glu Glu Gin Leu Leu Phe Thr Phe Val Thr 
725 730 735 



Ala Asp Asn Gin lie lie Phe Phe Thr Lys Asp His Asn Asp Arg lie 
740 745 750 



Phe Gin Leu Asn Gly Val Asp Gin Leu Asn Glu Ser Leu Tyr lie Ser 
755 760 765 



Thr Tyr Gin Leu Gly Asp Glu lie Val Pro Asp Pro Ser lie Lys Gin 
770 775 780 



Val Met lie Asn Lys Leu Gly His Asp Asn Lys Glu Glu Tyr Leu Thr 
785 790 795 800 



lie Leu Thr Phe Gly Gly Glu lie Tyr Gin Tyr Arg Lys Leu Pro Gin 
805 810 815 
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Arg Arg Ser Arg Phe Tyr Arg. Asn Val Thr Arg Asn Asp Leu Ala lie 
820 825 830 



Thr Gly Ala Pro Asp Asn Ala Tyr Ala Lys Gly Val Ser Ser lie Glu 
835 840 845 



Arg lie Met His Tyr Phe Pro Asp Tyr Asn Gly Tyr Ser Val lie Phe 
850 855 860 



Val Thr Gly Ser Val Pro Tyr lie Leu He Lys Glu Asp Asp Ser Thr 
865 870 875 880 



Pro Lys lie Phe Lys Phe Gly Asn lie Pro Leu Val Ser Val Thr Pro 
885 890 895 



Trp Ser Glu Arg Ser Val Met Cys Val Asp Asp lie Lys Asn Ala Arg 
900 905 910 



Val Tyr Thr Leu Thr Thr Asp Asn Met Tyr Tyr Gly Asn Lys Leu Pro 
915 920 925 



Leu Lys Gin lie Lys lie Ser Asn Val Leu Asp Asp Tyr Lys Thr Leu 
930 935 940 



Gin Lys Leu Val Tyr His Glu Arg Ala Gin Leu Phe Leu Val Ser Tyr 
945 950 955 960 



Cys Lys Arg Val Pro Tyr Glu Ala Leu Gly Glu Asp Gly Glu Lys Val 
965 970 975 



lie Gly Tyr Asp Glu Asn Val Pro His Ala Glu Gly Phe Gin Ser Gly 
980 985 990 



lie Leu Leu lie Asn Pro Lys Ser Trp Lys Val lie Asp Lys lie Asp 
995 1000 1005 



Phe Pro Lys Asn Ser Val Val Asn Glu Met Arg Ser Ser Met lie 
1010 1015 1020 



Gin lie Asn Ser Lys Thr Lys Arg Lys Arg Glu Tyr lie lie Ala 
1025 1030 1035 



Gly Val Ala Asn Ala Thr Thr Glu Asp Thr Pro Pro Thr Gly Ala 
1040 1045 1050 



Phe His lie Tyr Asp Val He Glu Val Val Pro Glu Pro Gly Lys 
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1055 1060 1065 

Pro Asp Thr Asn Tyr Lys Leu Lys Glu lie Phe Gin Glu Glu Val 

1070 1075 1080 

Ser Gly Thr Val Ser Thr Val Cys Glu Val Ser Gly Arg Phe Met 

1085 1090 1095 

lie Ser Gin Ser Gin Lys Val Leu Val Arg Asp lie Gin Glu Asp 

1100 1105 1110 

Asn Ser Val lie Pro Val Ala Phe Leu Asp lie Pro Val Phe Val 

1115 1120 1125 

Thr Asp Ser Lys Ser Phe Gly Asn Leu Leu lie lie Gly Asp Ala 

1130 1135 1140 

Met Gin Gly Phe Gin Phe lie Gly Phe Asp Ala Glu Pro Tyr Arg 

1145 1150 1155 

Met lie Ser Leu Gly Arg Ser Met Ser Lys Phe Gin Thr Met Ser 

1160 1165 1170 

Leu Glu Phe Leu Val Asn Gly Gly Asp Met Tyr Phe Ala Ala Thr 

1175 1180 1185 

Asp Ala Asp Arg Asn Val His Val Leu Lys Tyr Ala Pro Asp Glu 

1190 1195 1200 

Pro Asn Ser Leu Ser Gly Gin Arg Leu Val His Cys Ser Ser Phe 

1205 1210 1215 

Thr Leu His Ser Thr Asn Ser Cys Met Met Leu Leu Pro Arg Asn 

1220 1225 1230 

Glu Glu Phe Gly Ser Pro Gin Val Pro Ser Phe Gin Asn Val Gly 

1235 1240 1245 

Gly Gin Val Asp Gly Ser Val Phe Lys lie Val Pro Leu Ser Glu 

1250 1255 1260 

Glu Lys Tyr Arg Arg Leu Tyr Val He Gin Gin Gin lie lie Asp 

1265 1270 1275 

Arg Glu Leu Gin Leu Gly Gly Leu Asn Pro Arg Met Glu Arg Leu 

1280 1285 1290 
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Ala Asn Asp Phe Tyr Gin Met Gly 

1295 1300 

Asp Phe Asn Val lie Arg Arg Phe 

1310 1315 

Arg Lys Ser lie Ala Gin Lys Ala 

1325 1330 

Ala Trp Arg Asp lie He Asn lie 

1340 1345 

Cys Gin Gly Lys 
1355 

<210> 4 

< 211 > 2000 
<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 4 

atgaatgtat atgatgatgt gcttgacgct 
tttactactt cagattatga ggagctctta 
aggcctacta gggatggtaa actgtatttg 
accgatatag gcctcattcc ccaaaaagac 
ggcgtcgcta aaatctctat tttaaagttc 
agcctgcatt actacgaagg aaagttcaag 
tccaccttga gaatggatcc tggaagttct 
gcttttctac ccttccatgt caataaaaat 
gagaacatag acgatagtga attaatccat 
actttcaata agaggaagag aaccaagtta 
ttggtagcga gtgagttgta cgaaggcgcc 
aattttacta aaccaacgat agcgctctta 
accaccattt caaaactccc tacacaatac 
gaaagtgcca ccaagattga atcaaccaca 
ctacacacca ttgtacctgt ttcgaatggt 
tttctagata atactggcgt tttacaatcg 
gagttacaga aaacaaaaat tatcaacaat 
aacactacgt caatttggat accttcgtcc 
gaaacactac tacttatgga cttgaaatcc 



His Ser Met Arg Pro Met Leu 
1305 

Cys Gly Leu Ala lie Asp Arg 
1320 

Gly Arg His Ala His Phe Glu 
1335 

Glu Phe Ser Met Arg Ser Leu 
1350 



actgttgtgt ctcattcatt agcaacacat 
9 tggtgcgaa caaatatcct ttctgtctac 
actgatgagt ttaagtttca tggcttgatc 
agtccgttaa gttgcctgct tttatgcaca 
aacacgctaa caaactcaat cgatacccta 
ggtaaatcgt tggtagagct ggccaaaatt 
tgtgccttac tgttcaataa tgacattatc 
gacgacgatg aggaggaaga agatgaggac 
agcatgaatc aaaaatccca agggacgaat 
ggtgacaagt tcacggctcc aagcgtggta 
aaaaatatca tagatattca gttcttgaaa 
taccaaccaa agcttgtttg ggcaggaaac 
gttatactca cattaaatat ccagcctgca 
atagcatttg tcaaggaact gccttgggat 
gctatcattg tgggaaccaa tgaactggca 
acagtattat taaactcatt tgctgataaa 
tcttcattgg aaattatgtt cagggagaaa 
aagagcaaaa acggaggaag taataatgac 
aatatatact atatccaaat ggaagcagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



1140 
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ggcaggttac 


tgattaaatt 


cgatattttc 


aagctaccta 


tagtcaatga 


tcttttgaag 


1200 


gagaactcca 


atccaaaatg 


tataacacgt 


ttgaatgcca 


ccaattcaaa 


caaaaatatg 


1260 


gacttattta 


ttgggtttgg 


ttcaggaaat 


gctttggttc 


tcagattgaa 


taatttgaaa 


1320 


tctaccattg 


aaacaagaga 


agcacacaat 


ccatcttcag gtacaaatag 


tttgatggat 


1380 


atcaatgacg 


acgatgatga 


ggaaatggac 


gatttatacg 


ccgatgaagc 


tcctgaaaat 


1440 


ggactaacaa 


ctaatgattc 


caaaggcact 


gtggaaactg 


ttcaaccttt 


cgatatcgaa 


1500 


ctgttatcat 


ctctaaggaa 


tgttggtccc 


atcacatcgt 


taactgtagg 


taaggtatct 


1560 


tccattgatg 


a t 9 tggtaaa 


aggactacca 


aatccaaaca 


aaaacgagta 


ttcgctagtt 


1620 


gccacatcgg 


gaaatggttc 


aggttctcat 


ttgaccgtca 


tacagactag 


tgttcaaccg 


1680 


gaaattgaat 


tagcattgaa 


atttatcagt 


ataacacaaa 


tctggaatct 


gaagatcaaa 


1740 


ggaagagata 


ggtacttaat 


aactactgat 


tccacaaaat 


cccggagtga 


catatatgaa 


1800 


agtgacaata 


acttcaaact 


tcataagggc 


ggccgtttaa 


gaagagatgc 


tactacagtt 


1860 


tacatttcaa 


tgtttggtga 


agaaaaaaga 


ataatacaag 


tcaccaccaa 


tcacttatat 


1920 


ttatatgata 


cacattttag 


acgtctcacc 


acaattaaat 


ttgactacga 


agttattcat 


1980 


gtttccgtca 


tggatccgta 










2000 



<210> 5 

<211> 859 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<40 0> 5 

Met Thr Tyr Lys Tyr Asn Cys Cys Asp Asp Gly Ser Gly Thr Thr Val 
15 10 15 

Gly Ser Val Val Arg Phe Asp Asn Val Thr Leu Leu lie Asp Pro Gly 
20 25 30 

Trp Asn Pro Ser Lys Val Ser Tyr Glu Gin Cys lie Lys Tyr Trp Glu 
35 40. 45 

Lys Val lie Pro Glu lie Asp Val lie lie Leu Ser Gin Pro Thr lie 
50 55 60 

Glu Cys Leu Gly Ala His Ser Leu Leu Tyr Tyr Asn Phe Thr Ser His 

65 70 75 BO 

Phe lie Ser Arg lie Gin Val Tyr Ala Thr Leu Pro Val He Asn Leu 

85 90 95 
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Gly Arg 


Val 


Ser 


Thr 


lie 


Asp 


Ser 


Tyr Ala 


Ser 


Ala 


Gly Val 


He 


Gly 








100 










105 










110 






Pro 


Tyr 


Asp 


Thr 


Asn 


Lys 


Leu 


Asp 


Leu 


Glu Asp 


lie 


Glu 


lie 


Ser 


Phe 






115 










120 










125 








Asp 


His 


lie 


Val 


Pro 


Leu 


Lys 


Tyr 


Ser 


Gin 


Leu 


Val 


Asp 


Leu 


Arg 


Ser 




130 










135 










14 0 










Arg 


Tyr 


Asp 


Gly 


Leu 


Thr 


Leu 


Leu 


Ala 


Tyr 


Asn 


Ala 


Gly Val 


Cys 


Pro 


145 










150 










155 










160 


Gly Gly 


Ser 


lie 


Trp 


Cys 


lie 


Ser 


Thr 


Tyr 


Ser 


Glu 


Lys 


Leu 


Val 


Tyr 










165 










170 










175 




Ala 


Lys 


Arg 


Trp 


Asn 


His 


Thr 


Arg Asp 


Asn 


lie 


Leu 


Asn 


Ala 


Ala 


Ser 








180 










185 










190 






lie 


Leu Asp 


Ala 


Thr 


Gly Lys 


Pro 


Leu 


Ser 


Thr 


Leu 


Met 


Arg 


Pro 


Ser 






195 










200 










205 








Ala 


He 


He 


Thr 


Thr 


Leu Asp 


Arg 


Phe 


Gly Ser 


Ser 


Gin 


Pro 


Phe 


Lys 




210 










215 










220 










Lys 


Arg 


Ser 


Lys 


lie 


Phe 


Lys 


Asp 


Thr 


Leu 


Lys 


Lys 


Gly 


Leu 


Ser 


Ser 


225 










230 










235 










240 


Asp Gly 


Ser 


Val 


lie 


lie 


Pro 


Val 


Asp 


Met 


Ser Gly 


Lys 


Phe 


Leu Asp 










245 










250 










255 




Leu 


Phe 


Thr 


Gin 


Val 


His 


Glu 


Leu 


Leu 


Phe 


Glu 


Ser 


Thr 


Lys 


lie 


Asn 








260 










265 










270 






Ala 


His 


Thr 


Gin 


Val 


Pro 


Val 


Leu 


lie 


Leu 


Ser 


Tyr Ala 


Arg 


Gly Arg 






275 










280 










285 








Thr 


Leu 


Thr 


Tyr Ala 


Lys 


Ser 


Met 


Leu 


Glu 


Trp 


Leu 


Ser 


Pro 


Ser 


Leu 




290 










295 










300 










Leu 


Lys 


Thr 


Trp 


Glu 


Asn Arg 


Asn 


Asn 


Thr 


Ser 


Pro 


Phe 


Glu 


lie 


Gly 


305 










310 










315 










320 


Ser 


Arg 


lie 


Lys 


lie 


lie 


Ala 


Pro 


Asn 


Glu 


Leu 


Ser 


Lys 


Tyr 


Pro Gly 










325 










330 










335 




Ser 


Lys 


lie 


Cys 


Phe 


Val 


Ser 


Glu 


Val 


Gly Ala 


Leu 


lie 


Asn 


Glu 


Val 








340 










345 










350 
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He He Lys Val Gly Asn Ser Glu Lys Thr Thr Leu lie Leu Thr Lys 
355 360 365 



Pro Ser Phe Glu Cys Ala Sex Sex Leu Asp Lys lie Leu Glu He Val 
370 375 380 



Glu Gin Asp Glu Arg Asn Trp Lys Thr Phe Pro Glu Asp Gly Lys Ser 
385 390 395 400 



Phe Leu Cys Asp Asn Tyr lie Ser lie Asp Thr lie Lys Glu Glu Pro 
405 410 415 



Leu Ser Lys Glu Glu Thr Glu Ala Phe Lys Val Gin Leu Lys Glu Lys 
420 425 430 



Lys Arg Asp Arg Asn Lys Lys lie Leu Leu Val Lys Arg Glu Ser Lys 
435 440 445 



Lys Leu Ala Asn Gly Asn Ala lie lie Asp Asp Thr Asn Gly Glu Arg 
450 455 460 



Ala Met Arg Asn Gin Asp lie Leu Val Glu Asn Val Asn Gly Val Pro 
465 470 475 480 



Pro lie Asp His lie Met Gly Gly Asp Glu Asp Asp Asp Glu Glu Glu 
485 490 495 



Glu Asn Asp Asn Leu Leu Asn Leu Leu Lys Asp Asn Ser Glu Lys Ser 
500 505 510 



Ala Ala Lys Lys Asn Thr Glu Val Pro Val Asp lie lie lie Gin Pro 
515 520 525 



Ser Ala Ala Ser Lys His Lys Met Phe Pro Phe Asn Pro Ala Lys lie 
530 535 540 



Lys Lys Asp Asp Tyr Gly Thr Val Val Asp Phe Thr Met Phe Leu Pro 
545 550 555 560 



Asp Asp Ser Asp Asn Val Asn Gin Asn Ser Arg Lys Arg Pro Leu Lys 
565 570 575 



Asp Gly Ala Lys Thr Thr Ser Pro Val Asn Glu Glu Asp Asn Lys Asn 
580 585 590 



Glu Glu Glu Asp Gly Tyr Asn Met Ser Asp Pro lie Ser Lys Arg Ser 
595 600 605 
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Lys 


His 


Arg 


Ala 


Ser 


Arg 


Tyr 


Ser 


Gly Phe 


Ser 


Gly Thr 


Gly 


Glu 


Ala 




610 










615 










620 










Glu 


Asn 


Phe Asp 


Asn 


Leu Asp 


Tyr 


Leu 


Lys 


lie 


Asp 


Lys 


Thr 


Leu 


Ser 


625 










630 










635 










640 


Lys 


Arg 


Thr 


lie 


Ser 


Thr 


Val 


Asn 


Val 


Gin 


Leu 


Lys 


Cys 


Ser 


Val 


Val 










645 










650 










655 




He 


Leu 


Asn 


Leu 


Gin 


Ser 


Leu 


Val 


Asp 


Gin 


Arg 


Ser 


Ala 


Ser 


lie 


lie 








660 










665 










670 






Trp 


Pro 


Ser 


Leu 


Lys 


Ser 


Arg 


Lys 


lie 


Val 


Leu 


Ser 


Ala 


Pro 


Lys 


Gin 






675 










680 










685 








lie 


Gin 


Asn 


Glu 


Glu 


lie 


Thr 


Ala 


Lys 


Leu 


lie 


Lys 


Lys 


Asn 


lie 


Glu 




690 










695 










700 










val 


Val 


Asn 


Met 


Pro 


Leu 


Asn 


Lys 


lie 


Val 


Glu 


Phe 


Ser 


Thr 


Thr 


lie 


705 










710 










715 










720 


Lys 


Thr 


Leu Asp 


lie 


Ser 


lie 


Asp 


Ser 


Asn 


Leu Asp 


Asn 


Leu 


Leu 


Lys 










725 










730 










735 




Trp 


Gin Arg 


lie 


Ser 


Asp 


Ser 


Tyr 


Thr 


Val 


Ala 


Thr 


Val 


Val 


Gly Arg 








740 










745 










750 






Leu 


Val 


Lys 


Glu 


Ser 


Leu 


Pro 


Gin 


Val 


Asn 


Asn 


His 


•Gin 


Lys 


Thr 


Ala 






755 










760 










765 








Ser 


Arg 


Ser 


Lys 


Leu 


Val 


Leu 


Lys 


Pro 


Leu 


His 


Gly Ser 


Ser 


Arg 


Ser 




770 










775 










780 










His 


Lys 


Thr 


Gly Ala 


Leu 


Ser 


lie 


Gly Asp 


Val 


Arg 


Leu 


Ala 


Gin 


Leu 


785 










790 










795 










800 


Lys 


Lys 


Leu 


Leu 


Thr 


Glu 


Lys 


Asn 


Tyr 


lie 


Ala 


Glu 


Phe 


Lys 


Gly Glu 










805 










810 










815 




Gly 


Thr 


Leu 


Val 


lie 


Asn 


Glu 


Lys 


Val 


Ala 


Val 


Arg 


Lys 


lie 


Asn Asp 








820 










825 










830 






Ala 


Glu 


Thr 


lie 


lie 


Asp 


Gly Thr 


Pro 


Ser 


Glu 


Leu 


Phe 


Asp 


Thr 


Val 






835 










840 










845 








Lys 


Lys 


Leu 


Val 


Thr Asp 


Met 


Leu 


Ala 


Lys 


lie 
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850 855 

< 210 > 6 
< 211 > 2000 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 6 



atgacttata 


aatacaattg 


ctgtgatgat 


ggatcaggaa 


ccaccgttgg 


ttcagtggta 


60 


cgatttgata 


acgtcacctt 


gttaatcgat 


cccggttgga 


atccatctaa 


agtatcgtat 


120 


gagcaatgca 


tcaagtactg 


ggaaaaagtg 


ataccggaaa 


ttgatgtaat 


aatactatca 


180 


caaccaacaa 


ttgaatgttt 


aggtgctcat 


tcccttctgt 


actataattt 


tacctcgcat 


240 


ttcatatcca 


gaattcaagt 


ttacgcaact 


cttcctgtta 


tcaacttagg 


gagagtttcc 


300 


accatagatt 


catatgcatc 


tgcaggtgtt 


ataggtccat 


atgacacaaa 


taaactcgat 


360 


cttgaagata 


ttgagatatc 


ttttgatcat 


attgtacctt 


tgaagtactc 


gcaattggta 


420 


gatctacgat 


caagatacga 


tggattgact 


ttactggcgt 


acaatgctgg 


tgtgtgtcca 


480 


ggtggttcta 


tttggtgcat 


tagcacatac 


tcagagaaat 


tggtttatgc 


aaagcgatgg 


540 


aaccacacta 


gagataacat 


actaaatgct 


gcttctatat 


tggacgccac 


tggtaagccc 


600 


ttatcaactt 


tgatgaggcc 


ctctgcaata 


attactactt 


tggacaggtt 


tggctcctct 


660 


caaccgttta 


aaaaacgttc 


aaaaatcttt 


aaagatacat 


taaagaaagg 


tttgagttcg 


720 


gatgggtccg 


ttataatacc 


ggtagatatg 


agtggcaaat 


ttctggacct 


gttcacacaa 


780 


gttcatgagt 


tattatttga 


aagcacaaag 


atcaatgcgc 


acacacaagt 


acctgtactt 


840 


attctgtctt 


acgcgagagg 


aagaactcta 


acatatgcca 


aatcaatgct 


cgagtggctg 


900 


tccccttcac 


tacttaagac 


atgggaaaac 


agaaataata 


cttctccatt 


tgaaattgga 


960 


tcacggatca 


aaatcatcgc 


accaaacgaa 


ttgagtaaat 


atcctggttc 


aaagatttgc 


1020 


ttcgtatctg 


aggtaggagc 


tttgattaat 


gaagtgataa 


ttaaagtfcgg 


taattctgag 


1080 


aagacaacct 


tgattttgac 


caagccgagc 


tttgaatgtg 


cgtcgtcttt 


agacaagata 


1140 


ctagaaattg 


ttgaacaaga 


tgaaagaaat 


tggaaaacat 


ttcctgaaga 


tggtaaatca 


1200 


ttcctctgcg 


ataattacat 


ttcaatagac 


accataaaag 


aagaaccctt 


gagtaaagag 


1260 


gaaacagagg 


cgtttaaagt 


acagcttaaa 


gaaaagaaaa 


gggacagaaa 


taagaaaatt 


1320 


ctactggtga 


aaagagagtc 


taagaaactg 


gctaacggca 


acgcaatcat 


agatgatacc 


1380 


aacggagaga 


gggcaatgag 


aaatcaagac 


attttagtgg 


aaaacgtcaa 


tggagtacca 


1440 


ccgattgatc 


atattatggg 


gggtgacgaa 


gatgatgatg 


aagaggaaga 


aaacgacaat 


1500 


ttgttgaatt 


tactcaagga 


taactctgag 


aaatcagcag 


cgaaaaaaaa 


tacggaagtt 


1560 


cctgtagaca 


ttataattca 


accaagtgcc 


gcttcaaaac 


ataaaatgtt 


cccatttaac 


1620 
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cccgctaaaa tcaagaaaga tgattatggt actgttgtag attttacgat gtttttacct 
gatgattctg acaatgttaa tcaaaatagc aggaaaagac ctctcaaaga cggtgctaaa 
accacgagcc ctgtaaacga agaagataac aaaaatgaag aagaagatgg ttataatatg 
agtgatccaa ttagcaaaag gagcaaacac cgtgcttctc gttactcggg attttctgga 
accggagaag cagaaaattt tgataacctt gactatttaa agatagacaa aacgctctcc 
aaaagaacaa tctcaactgt aaatgtccaa ttgaaatgct ccgtagtaat cctaaattta 
cagagtttag ttgatcaaag 



<210> 7 

<211> 372 

<212 > PRT 

<213 > Saccharomyces cerevisiae 
<400> 7 

Met Lys Cys Arg Val Trp Ser Glu Ala Arg Val Tyr Thr Asn lie Asn 
15 10 15 



Lys Gin Arg Thr Glu Glu Tyr Trp Asp Tyr Glu Asn Thr Val lie Asp 
20 25 30 



Trp Ser Thr Asn Thr Lys Asp Tyr Glu lie Glu Asn Lys Val Gly Arg 
35 40 45 



Gly Lys Tyr Ser Glu Val Phe Gin Gly Val Lys Leu Asp Ser Lys Val 
50 55 60 



Lys lie Val lie Lys Met Leu Lys Pro Val Lys Lys Lys- Lys lie Lys 
65 70 75 80 



Arg Glu lie Lys lie Leu Thr Asp Leu Ser Asn Glu Lys Val Pro Pro 
85 90 95 



Thr Thr Leu Pro Phe Gin Lys Asp Gin Tyr Tyr Thr Asn’ Gin Lys Glu 
100 105 110 



Asp Val Leu Lys Phe lie Arg Pro Tyr lie Phe Asp Gin Pro His Asn 
115 120 125 - 



Gly His Ala Asn lie lie His Leu Phe Asp lie He Lys Asp Pro lie 
130 135 140 



Ser Lys Thr Pro Ala Leu Val Phe Glu Tyr Val Asp Asn Val Asp Phe 
145 150 155 160 



1680 

1740 

1800 

1860 

1920 

1980 

2000 



Arg He Leu Tyr Pro Lys Leu Thr Asp Leu Glu lie Arg Phe Tyr Met 
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165 170 175 



Phe Glu Leu Leu Lys Ala Leu Asp Tyr Cys His Ser Met Gly lie Met 
180 185 190 



His Arg Asp Val Lys Pro His Asn Val Met lie Asp His Lys Asn Lys 
195 200 205 



Lys Leu Arg Leu He Asp Trp Gly Leu Ala Glu Phe Tyr His Val Asn 
210 215 220 



Met Glu Tyr Asn Val Arg Val Ala Ser Arg Phe Phe Lys Gly Pro Glu 
225 230 235 240 



Leu Leu Val Asp Tyr Arg Met Tyr Asp Tyr Ser Leu Asp Leu Trp Ser 
245 250 255 



Phe Gly Thr Met Leu Ala Ser Met lie Phe Lys Arg Glu Pro Phe Phe 
260 265 270 



His Gly Thr Ser 'Asn Thr Asp Gin Leu Val Lys lie Val Lys Val Leu 
275 280 285 



Gly Thr Ser Asp Phe Glu Lys Tyr Leu Leu Lys Tyr Glu lie Thr Leu 
290 295 300 



Pro Arg Glu Phe Tyr Asp Met Asp Gin Tyr lie Arg Lys Pro Trp His 
305 310 315 320 



Arg Phe lie Asn Asp Gly Asn Lys His Leu Ser Gly Asn Asp Glu lie 
325 330 335 



lie Asp Leu lie Asp Asn Leu Leu Arg Tyr Asp His Gin Glu Arg Leu 
340 345 350 



Thr Ala Lys Glu Ala Met Gly His Pro Trp Phe Ala Pro lie Arg Glu 
355 360 365 



Gin lie Glu Lys 
370 



< 210 > 8 
<211> 1119 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 8 

atgaaatgca gggtatggtc agaggctcgt gtttatacga atatcaataa acaaagaacc 60 
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gaggaatatt 


gggattatga 


aaatactgta 


attgattggt 


ccacaaatac 


aaaggactat 


120 


gaaattgaaa 


ataaagttgg 


gcgaggaaaa 


tactccgagg 


ttttccaagg 


tgtcaaatta 


180 


gactctaaag 


ttaaaattgt 


tattaagatg 


ttgaaaccag 


ttaaaaagaa gaagatcaag 


240 


agagaaatca 


aaatcttaac 


ggatttgtct 


aacgaaaaag 


tgcctccaac 


gactttgcca 


300 


tttcaaaaag 


atcaatatta 


cacaaatcaa 


aaggaggacg 


ttttaaaatt 


catcaggccc 


360 


tatatctttg 


atcagccaca 


caatggtcat 


gcaaacataa 


ttcatctatt 


tgatataata 


420 


aaggatccca 


tctcaaaaac 


tccggctttg 


gtcttcgaat 


acgtagataa 


tgtggactto 


480 


cgtattcttt 


accctaaatt 


aaccgatctc 


gaaattaggt 


tttafcatgtt 


tgagttatta 


540 


aaggccttag 


actattgtca 


ttcaatggga 


ataatgcata 


gagatgttaa 


acctcataac 


600 


gtaatgattg 


atcataagaa 


taaaaaattg 


cgattgatag 


attgggggct 


tgctgaattt 


660 


tatcatgtta 


atatggaata 


caatgttcgt 


gttgcgtcga 


ggttttttaa 


gggtcctgaa 


720 


ctactagttg 


actacagaat 


gtatgattat 


tctttagact 


tgtggtcgtt 


tgggacaatg 


780 


ttggcttcta 


tgatctttaa 


aagagagcca 


tttttccatg 


gaacgagtaa 


cacagaccag 


840 


cttgtcaaga 


tcgtcaaagt 


acttggtaca 


agcgattttg 


agaaatacct 


gttgaagtat 


900 


gaaattacct 


taccgagaga 


attttacgat 


atggaccaat 


acatcagaaa 


gccttggcat 


960 


agattcatca 


atgatggtaa 


taaacattta 


agcggcaacg 


atgaaattat 


tgaccttatt 


1020 


gacaatcttt 


tgagatatga 


tcatcaagaa 


agattaactg 


ctaaggaagc 


gatgggacac 


1080 


ccgtggtttg 


ccccaataag 


ggaacaaatt 


gaaaaataa 






1119 



<210> 9 

<211> 445 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 9 

Met Ala Ser Leu Pro Gly lie Asp Glu His Thr Thr Ser Glu Glu Leu 
15 10 15 

lie Thr Gly Asp Asn Glu Trp His Lys Leu Val lie Pro Lys Gly Ser 
20 25 30 

Asp Trp Gin lie Asp Leu Lys Ala Glu Gly Lys Leu lie Val Lys Val 
35 40 45 

Asn Ser Gly lie Val Glu lie Phe Gly Thr Glu Leu Ala Val Asp Asp 
50 55 60 

Glu Tyr Thr Phe Gin Asn Trp Lys Phe Pro lie Tyr Ala Val Glu Glu 
65 70 75 80 
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Thr 


Glu 


Leu 


Leu 


Trp 


Lys 


Cys 


Pro Asp 


Leu 


Thr 


Thr 


Asn 


Thr 


lie 


Thr 










85 










90 










95 




Val 


Lys 


Pro 


Asn 


His 


Thr 


Met 


Lys 


Tyr 


lie 


Tyr 


Asn 


Leu 


His 


Phe 


Met 








100 










105 










110 






Leu 


Glu 


Lys 


lie 


Arg 


Met 


Ser 


Asn 


Phe 


Glu Gly 


Pro 


Arg 


Val 


Val 


lie 






115 










120 










125 








Val 


Gly Gly 


Ser 


Gin 


Thr 


Gly Lys 


Thr 


Ser 


Leu 


Ser Arg 


Thr 


Leu 


Cys 




130 










135 










140 










Ser 


Tyr 


Ala 


Leu 


Lys 


Phe 


Asn 


Ala 


Tyr 


Gin 


Pro 


Leu 


Tyr 


lie 


Asn 


Leu 


145 










150 










155 










160 


Asp 


Pro 


Gin 


Gin 


Pro 


lie 


Phe 


Thr 


Val 


Pro Gly 


Cys 


lie 


Ser 


Ala 


Thr 










165 










170 










175 




Pro 


He 


Ser 


Asp 


lie 


Leu 


Asp 


Ala 


Gin 


Leu 


Pro 


Thr 


Trp 


Gly Gin 


Ser 








180 










185 










190 






Leu 


Thr 


Ser 


Gly Ala 


Thr 


Leu 


Leu 


His 


Asn 


Lys 


Gin 


Pro 


Met 


Val 


Lys 






195 










200 










205 








Asn 


Phe 


Gly 


Leu 


Glu 


Arg 


lie 


Asn 


Glu 


Asn 


Lys 


Asp 


Leu 


Tyr 


Leu 


Glu 




210 










215 










220 










Cys 


lie 


Ser 


Gin 


Leu 


Gly Gin 


Val 


Val 


Gly Gin 


Arg 


Leu 


His 


Leu 


Asp 


225 










230 










235 










240 * 


Pro 


Gin 


Val 


Arg 


Arg 


Ser 


Gly Cys 


lie 


Val 


Asp 


Thr 


Pro 


Ser 


lie 


Ser 










245 










250 










255 




Gin 


Leu Asp 


Glu 


Asn 


Leu 


Ala 


Glu 


Leu 


His 


His 


lie 


lie 


Glu Lys 


Leu 








260 










265 










270 






Asn 


Val 


Asn 


lie 


Met 


Leu 


Val 


Leu 


Cys 


Ser 


Glu 


Thr 


Asp 


Pro 


Leu Trp 






275 










280 










285 








Glu 


Lys 


Val 


Lys 


Lys 


Thr 


Phe 


Gly 


Pro 


Glu 


Leu Gly Asn 


Asn 


Asn 


lie 




290 










295 










300 










Phe 


Phe 


He 


Pro 


Lys 


Leu Asp 


Gly Val 


Ser 


Ala 


Val 


Asp 


Asp 


Val 


Tyr 


305 










310 










315 










320 


Lys 


Arg 


Ser 


Leu 


Gin 


Arg 


Thr 


Ser 


lie 


Arg 


Glu 


Tyr 


Phe 


Tyr 


Gly Ser 










325 










330 










335 
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Leu Asp Thr Ala Leu Ser Pro Tyr Ala lie Gly Val Asp Tyr Glu Asp 
340 345 350 



Leu Thr lie Trp Lys Pro Ser Asn Val Phe Asp Asn Glu Val Gly Arg 
355 360 365 



Val Glu Leu Phe Pro Val Thr lie Thr Pro Ser Asn Leu Gin His Ala 
370 375 380 



lie lie Ala lie Thr Phe Ala Glu Arg Arg Ala Asp Gin Ala Thr Val 
385 390 395 400 



lie Lys Ser Pro lie Leu Gly Phe Ala Leu lie Thr Glu Val Asn Glu 
405 410 415 



Lys Arg Arg Lys Leu Arg Val Leu Leu Pro Val Pro Gly Arg Leu Pro 
420 425 430 



Ser Lys Ala Met He Leu Thr Ser Tyr Arg Tyr Leu Glu 
435 440 445 



< 210 > 10 
<211> 1338 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 10 

atggcaagtc tacctggtat tgatgagcat actacctctg aagaattgat aactggtgat 60 

aacgaatggc acaagttggt tattcccaag ggcagtgatt ggcaaattga tcttaaagca 120 

gagggaaaac tgatagtcaa ggtaaactcg ggcatagttg agatatttgg caccgagttg 180 

gcagtagatg atgagtacac ttttcagaac tggaagtttc ccatatacgc tgtcgaagaa 240 

acagaattat tatggaaatg tcctgattta actacgaata caataactgt caagcctaac 300 

catactatga aatatattta taatctacac ttcatgttgg agaagatacg aatgtctaac 360 

tttgaaggcc ctcgagtggt gattgttggt ggttcgcaaa ccgggaagac ttccctatcg 420 

aggacactat gttcttacgc tttgaaattc aacgcttacc aaccactata catcaatctt 480 

gaccctcaac agcccatttt tactgttcct ggatgcatat ctgccacccc aatatcagat 540 

atacttgatg cacaactacc cacttggggt cagagtctca ctagtggtgc cacactacta 600 

cataataagc agccaatggt gaaaaatttt ggcctggaaa ggattaatga gaacaaagat 660 

ctataccttg agtgtataag ccagttaggt caagtagtag gtcaaaggtt acatttggat 720 

cctcaagtca ggagatcagg gtgcattgtc gatacgccat caatatcaca actggatgaa 780 



aatttggctg aactgcacca tatcattgag aaactcaatg ttaacattat gctagtacta 



840 
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tgttctgaaa cggatcctct ttgggagaaa gtaaaaaaga catttggtcc agaattggga 900 
aataataata tttttttcat tcctaaattg gatggtgtat ccgcagttga tgatgtatac 960 
aaaagatcct tgcagaggac atccataaga gagtactttt acggctctct tgatacagcc 1020 
ttgagtcctt atgctattgg tgttgattat gaggatttaa ctatttggaa acctagcaat 1080 
gttttcgata acgaggtcgg tagggtggaa ctgttccccg tcactataac tccaagtaac 1140 
ctacagcacg ctattatagc cataacgttc gcagaaagaa gggcagatca ggcaacagta 1200 
ataaaatcgc ctattttagg attcgctttg attacagaag ttaatgaaaa aaggcgtaaa 1260 
ttaagggttt tactacctgt cccaggccga cttcccagca aggcgatgat tctaacttca 1320 
tatagatatt tagagtaa 1338 



< 210 > 11 
<211> 842 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 11 

Met Val Ala Phe Thr Val Asp Gin Met Arg Ser Leu Met Asp Lys Val 
15 10 15 



Thr Asn Val Arg Asn Met Ser Val lie Ala His Val Asp His Gly Lys 
20 25 30 



Ser Thr Leu Thr Asp Ser Leu Val Gin Arg Ala Gly lie He Ser Ala 
35 40 45 



Ala Lys Ala Gly Glu Ala Arg Phe Thr Asp Thr Arg Lys Asp Glu Gin 
50 55 60 



Glu Arg Gly He Thr He Lys Ser Thr Ala lie Ser Leu Tyr Ser Glu 
65 70 75 80 



Met Ser Asp Glu Asp Val Lys Glu lie Lys Gin Lys Thr Asp Gly Asn 
85 90 95 



Ser Phe Leu lie Asn Leu lie Asp Ser Pro Gly His Val Asp Phe Ser 
100 105 110 



Ser Glu Val Thr Ala Ala Leu Arg Val Thr Asp Gly Ala Leu Val Val 
115 120 125 



Val Asp Thr lie Glu Gly Val Cys Val Gin Thr Glu Thr Val Leu Arg 
130 135 140 
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Gin Ala Leu Gly Glu Arg lie Lys Pro Val Val Val He Asn Lys Val 
145 150 155 160 



Asp Arg Ala Leu Leu Glu Leu Gin Val Ser Lys Glu Asp Leu Tyr Gin 
165 170 175 



Thr Phe Ala Arg Thr Val Glu Ser Val Asn Val lie Val Ser Thr Tyr 
180 185 190 



Ala Asp Glu Val Leu Gly Asp Val Gin Val Tyr Pro Ala Arg Gly Thr 
195 200 205 



Val Ala Phe Gly' Ser Gly Leu His Gly Trp Ala Phe Thr lie Arg Gin 
210 215 220 



Phe Ala Thr Arg Tyr Ala Lys Lys Phe Gly Val Asp Lys Ala Lys Met 
225 230 235 240 



Met Asp Arg Leu Trp Gly Asp Ser Phe Phe Asn Pro Lys Thr Lys Lys 
245 250 255 



Trp Thr Asn Lys Asp Thr Asp Ala Glu Gly Lys Pro Leu Glu Arg Ala 
260 265 270 



Phe Asn Met Phe He Leu Asp Pro lie Phe Arg Leu Phe Thr Ala lie 
275 280 285 



Met Asn Phe Lys Lys Asp Glu lie Pro Val Leu Leu Glu Lys Leu Glu 
290 295 300 



lie Val Leu Lys Gly Asp Glu Lys Asp Leu Glu Gly Lys Ala Leu Leu 
305 310 315 320 



Lys Val Val Met Arg Lys Phe Leu Pro Ala Ala Asp Ala Leu Leu Glu 
325 330 335 



Met lie Val Leu His Leu Pro Ser Pro Val Thr Ala Gin Ala Tyr Arg 
340 345 350 



Ala Glu Gin Leu Tyr Glu Gly Pro Ala Asp Asp Ala Asn Cys lie Ala 
355 360 365 



lie Lys Asn Cys Asp Pro Lys Ala Asp Leu Met Leu Tyr Val Ser Lys 
370 375 380 



Met Val Pro Thr Ser Asp Lys Gly Arg Phe Tyr Ala Phe Gly Arg Val 
385 390 395 400 
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Phe Ala Gly Thr Val Lys Ser Gly Gin Lys Val Arg lie Gin Gly Pro 
405 410 415 



Asn Tyr Val Pro Gly Lys Lys Asp Asp Leu Phe lie Lys Ala lie Gin 
420 425 430 



Arg Val Val Leu Met Met Gly Arg Phe Val Glu Pro He Asp Asp Cys 
435 440 445 



Pro Ala Gly Asn lie He Gly Leu Val Gly lie Asp Gin Phe Leu Leu 
450 455 460 



Lys Thr Gly Thr Leu Thr Thr Ser Glu Thr Ala His Asn Met Lys Val 
465 470 475 480 



Met Lys Phe Ser Val Ser Pro Val Val Gin Val Ala Val Glu Val Lys 
485 490 495 



Asn Ala Asn Asp Leu Pro Lys Leu Val Glu Gly Leu Lys Arg Leu Ser 
500 505 510 



Lys Ser Asp Pro Cys Val Leu Thr Tyr Met Ser Glu Ser Gly Glu His 
515 520 525 



lie Val Ala Gly Thr Gly Glu Leu His Leu Glu lie Cys Leu Gin Asp 
530 535 540 



Leu Glu His Asp His Ala Gly Val Pro Leu Lys lie Ser Pro Pro Val 
545 550 555 560 



Val Ala Tyr Arg Glu Thr Val Glu Ser Glu Ser Ser Gin Thr Ala Leu 
565 570 575 



Ser Lys Ser Pro Asn Lys His Asn Arg He Tyr Leu Lys Ala Glu Pro 
580 585 590 



lie Asp Glu Glu Val Ser Leu Ala lie Glu Asn Gly lie lie Asn Pro 
595 600 605 



Arg Asp Asp Phe Lys Ala Arg Ala Arg lie Met Ala Asp Asp Tyr Gly 
610 615 620 



Trp Asp Val Thr Asp Ala Arg Lys lie Trp Cys Phe Gly Pro Asp Gly 
625 630 635 640 



Asn Gly Pro Asn Leu Val lie Asp Gin Thr Lys Ala Val Gin Tyr Leu 
645 650 655 
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His Glu lie Lys Asp Ser Val Val Ala Ala Phe Gin Trp Ala Thr Lys 
660 665 670 



Glu Gly Pro lie Phe Gly Glu Glu Met Arg Ser Val Arg Val Asn lie 
675 680 685 



Leu Asp Val Thr Leu His Ala Asp Ala He His Arg Gly Gly Gly Gin 
690 695 700 



lie lie Pro Thr Met Arg Arg Ala Thr Tyr Ala Gly Phe Leu Leu Ala 
705 710 715 720 



Asp Pro Lys lie Gin Glu Pro Val Phe Leu Val Glu lie Gin Cys Pro 
725 730 735 



Glu Gin Ala Val Gly Gly lie Tyr Ser Val Leu Asn Lys Lys Arg Gly 
740 745 750 



Gin Val Val Ser Glu Glu Gin Arg Pro Gly Thr Pro Leu Phe Thr Val 
755 760 765 



Lys Ala Tyr Leu Pro Val Asn Glu Ser Phe Gly Phe Thr Gly Glu Leu 
770 775 780 



Arg Gin Ala Thr Gly Gly Gin Ala Phe Pro Gin Met Val Phe Asp His 
785 790 795 800 



Trp Ser Thr Leu Gly Ser Asp Pro Leu Asp Pro Thr Ser Lys Ala Gly 
805 810 815 



Glu lie Val Leu Ala Ala Arg Lys Arg His Gly Met Lys Glu Glu Val 
820 825 830 



Pro Gly Trp Gin Glu Tyr Tyr Asp Lys Leu 
835 840 



< 210 > 12 
<211> 2529 

<212> DNA 

<213> Saccharomyces cerevisiae 
<4 00> 12 

atggttgctt tcactgttga ccaaatgcgt tctttaatgg acaaagttac caatgtgcgt 60 

aacatgtccg ttattgctca cgtcgatcat ggtaagtcca ctttgaccga ttccttggtc 120 

caaagagccg gtattatttc cgctgctaag gctggtgaag ctcgtttcac cgataccaga 180 

aaggatgaac aagaaagagg tatcactatc aagtctaccg ctatttctct atactctgaa 240 
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atgtctgacg aagatgtcaa 


ggaaatcaag 


caaaagaccg 


acggtaactc 


cttcttgatc 


300 


aacttgatcg 


actctccagg 


tcacgttgac 


ttctcctctg 


aagttactgc 


cgctttacgt 


360 


gtcactgacg gtgctttggt 


tgtcgtcgac 


accattgaag 


gtgtctgtgt 


ccaaaccgaa 


420 


actgttttga gacaagcttt 


gggtgagaga 


atcaagcctg 


ttgttgttat 


caacaaggtc 


480 


gacagagctt 


tgttggaatt 


gcaagtttct 


aaggaagatt 


tataccaaac 


ctttgccaga 


540 


actgttgaat 


ccgttaacgt 


catcgtttcc 


acctacgccg 


atgaagtttt 


gggtgatgtc 


600 


caagtttacc 


cagccagagg 


taccgttgcc 


ttcggttccg 


gtttgcacgg 


ttgggctttc 


660 


actatccgtc 


aattcgccac 


cagatatgct 


aagaaattcg 


gtgtcgacaa 


ggccaagatg 


720 


atggacagat 


tatggggtga 


ctctttcttc 


aacccaaaga 


ccaagaagtg 


gaccaacaag 


780 


gacactgatg 


ctgaaggtaa 


gccattggaa 


agagctttca 


acatgttcat 


cttggaccca 


840 


atcttcagat 


tattcactgc 


tatcatgaac 


ttcaagaaag 


atgaaattcc 


agttttgcta 


900 


gaaaagttgg 


aaattgtctt 


gaagggtgac 


gaaaaggact 


tggaaggtaa 


ggccttgttg 


960 


aaggttgtta 


tgagaaagtt 


cttgccagct 


gccgatgcct 


tattggaaat 


gattgtcttg 


1020 


cacttgccat 


ctccagtcac 


tgctcaagcc 


tacagagctg 


aacaattata 


cgaaggtcca 


1080 


gctgacgatg 


ccaactgtat 


tgctatcaag 


aactgtgatc 


caaaggctga 


tttgatgttg 


1140 


tacgtctcca 


agatggtgcc 


aacctctgat 


aagggtagat 


tctacgcctt 


cggtagagtt 


1200 


tttgccggta 


ctgttaagtc 


cggtcaaaag 


gtcagaatcc 


aaggtccaaa 


ctacgttcca 


1260 


ggtaagaagg acgatttgtt 


catcaaggcc 


attcaaagag 


ttgttttgat 


gatgggtaga 


1320 


tttgtcgaac 


caatcgatga 


ctgtccagcc 


ggtaacatta 


tcggtttagt 


cggtatcgat 


1380 


caattcttgt 


tgaagactgg 


tactttgacc 


accagtgaaa 


ctgctcacaa 


catgaaggtc 


1440 


atgaaattct 


ctgtctctcc 


agttgtgcaa 


gtcgctgtcg 


aagtcaagaa 


cgctaacgac 


1500 


ttaccaaaat 


tggtcgaagg 


tttgaagaga 


ttgtccaagt 


ctgatccatg 


tgtcttgacc 


1560 


tatatgtctg 


aatccggtga 


acatatcgtt 


gctggtaccg 


gtgaattgca 


tttggaaatt 


1620 


tgtttgcaag atttggaaca 


cgaccacgct 


ggtgttccat 


tgaagatctc 


cccaccagtt 


1680 


gtcgcttaca gagaaactgt 


tgaaagtgaa 


tcttctcaaa 


ctgctttgtc 


caagtctcca 


1740 


aacaagcata 


acagaatcta 


cttgaaggct 


gaaccaattg 


acgaagaagt 


ctctttggct 


1800 


attgaaaacg gtatcatcaa 


cccaagagat 


gatttcaagg 


ccagagctag 


aatcatggct 


1860 


gacgactacg gttgggatgt 


caccgatgcc 


agaaagatct 


ggtgtttcgg 


tccagacggt 


1920 


aacggtccaa 


acttggttat 


tgaccaaact 


aaggctgtcc 


aatacttgca 


cgaaatcaag 


1980 


gattccgttg 


ttgctgcttt 


ccaatgggct 


accaaggaag 


gtccaatttt 


cggtgaagaa 


2040 


atgagatctg 


tcagagttaa 


cattttggat 


gttactttac 


atgccgatgc 


tatccacaga 


2100 
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ggtggtggtc aaatcatccc aaccatgaga agagctactt acgctggttt cttgttggct 
gatccaaaga tccaagaacc agttttcttg gtcgaaattc aatgtccaga acaagccgtc 
ggtggtatct actccgtctt aaacaagaag agaggtcaag tcgtttctga agaacaaaga 
ccaggtactc cattgtttac cgtcaaggcc tacttgccag ttaacgaatc tttcggtttc 
actggtgaat tgagacaagc tactggtggt caagctttcc cacaaatggt tttcgaccat 
tggtccactt taggttctga cccattggac ccaacctcta aggctggtga aattgttctt 
gctgctcgta agagacacgg tatgaaggaa gaagttccag gctggcaaga atattacgac 
aaattgtaa 



<210> 13 

<211> 437 

<212> PRT 

<213 > Sac char oinyces cerevisiae 
<400> 13 

Met Ala Val Ser Lys Val Tyr Ala Arg Ser Val Tyr Asp Ser Arg Gly 
15 10 15 



Asn Pro Thr Val Glu Val Glu Leu Thr Thr Glu Lys Gly Val Phe Arg 
20 25 30 



Ser lie Val Pro Ser Gly Ala Ser Thr Gly Val His Glu Ala Leu Glu 
35 40 45 



Met Arg Asp Glu Asp Lys Ser Lys Trp Met Gly Lys Gly Val Met Asn 
50 55 60 



Ala Val Asn Asn Val Asn Asn Val lie Ala Ala Ala Phe Val Lys Ala 
65 70 75 80 



Asn Leu Asp Val Lys Asp Gin Lys Ala Val Asp Asp Phe Leu Leu Ser 
85 90 95 



Leu Asp Gly Thr Ala Asn Lys Ser Lys Leu Gly Ala Asn Ala lie Leu 
100 105 110 



Gly Val Ser Met Ala Ala Ala Arg Ala Ala Ala Ala Glu Lys Asn Val 
115 120 125 



Pro Leu Tyr Gin His Leu Ala Asp Leu Ser Lys Ser Lys Thr Ser Pro 
130 135 140 



Tyr Val Leu Pro Val Pro Phe Leu Asn Val Leu Asn Gly Gly Ser His 
145 150 155 160 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2529 
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Ala 


Gly 


Gly Ala 


Leu 


Ala 


Leu 


Gin 


Glu 


Phe 


Met 


lie 


Ala 


Pro 


Thr 


Gly 










165 










170 










175 




Ala 


Lys 


Thr 


Phe 


Ala 


Glu 


Ala 


Met 


Arg 


lie 


Gly 


Ser 


Glu 


Val 


Tyr 


His 








180 










185 










190 






Asn 


Leu 


Lys 


Ser 


Leu 


Thr 


Lys 


Lys 


Arg 


Tyr 


Gly Ala 


Ser 


Ala 


Gly Asn 






195 










200 










205 








Val 


Gly Asp 


Glu 


Gly Gly Val 


Ala 


Pro 


Asn 


lie 


Gin 


Thr 


Ala 


Glu 


Glu 




210 










215 










220 










Ala 


Leu 


Asp 


Leu 


lie 


Val 


Asp 


Ala 


lie 


Lys 


Ala 


Ala 


Gly His 


Asp 


Gly 


225 










230 










235 










240 


Lys 


Val 


Lys 


He 


Gly 


Leu 


Asp 


Cys 


Ala 


Ser 


Ser 


Glu 


Phe 


Phe 


Lys 


Asp 










245 










250 










255 




Gly 


Lys 


Tyr Asp 


Leu 


Asp 


Phe 


Lys 


Asn 


Pro 


Glu 


Ser 


Asp 


Lys 


Ser 


Lys 








260 










265 










270 






Trp 


Leu 


Thr 


Gly Val 


Glu 


Leu 


Ala 


Asp 


Met 


Tyr 


His 


Ser 


Leu 


Met 


Lys 






275 










280 










285 








Arg 


Tyr 


Pro 


lie 


Val 


Ser 


lie 


Glu 


Asp 


Pro 


Phe 


Ala 


Glu Asp Asp 


Trp 




290 










295 










300 










Glu 


Ala 


Trp 


Ser 


His 


Phe 


Phe 


Lys 


Thr 


Ala 


Gly 


lie 


Gin 


lie 


Val 


Ala 


305 










310 










315 










320 


Asp 


Asp 


Leu 


Thr 


Val 


Thr 


Asn 


Pro 


Ala 


Arg 


He 


Ala 


Thr 


Ala 


lie 


Glu 










325 










330 










335 




Lys 


Lys 


Ala 


Ala Asp 


Ala 


Leu 


Leu 


Leu 


Lys 


Val 


Asn 


Gin 


lie 


Gly 


Thr 








340 










345 










350 






Leu 


Ser 


Glu 


Ser 


He 


Lys 


Ala 


Ala 


Gin Asp 


Ser 


Phe 


Ala 


Ala 


Asn 


Trp 






355 










360 










365 








Gly 


Val 


Met 


Val 


Ser 


His 


Arg 


Ser 


Gly Glu 


Thr 


Glu 


Asp 


Thr 


Phe 


lie 




370 










375 










380 










Ala 


Asp 


Leu 


Val 


Val 


Gly Leu Arg 


Thr 


Gly Gin 


lie 


Lys 


Thr 


Gly Ala 


385 










390 










395 










400 


Pro 


Ala 


Arg 


Ser 


Glu Arg 


Leu 


Ala 


Lys 


Leu 


Asn 


Gin 


Leu 


Leu 


Arg 


lie 










405 










410 










415 






WO 02/092626 



27/148 



PCT/EP02/05359 



Glu Glu Glu Leu Gly Asp Lys Ala Val Tyr Ala Gly Glu Asn Phe His 
420 425 430 



His Gly Asp Lys Leu 
435 

<2 10> 14 

<211> 1314 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 14 



atggctgtct 


ctaaagttta 


cgctagatcc 


gtctacgact 


cccgtggtaa 


cccaaccgtc 


60 


gaagtcgaat 


taaccaccga 


aaagggtgtt 


ttcagatcca 


ttgttccatc 


tggtgcctcc 


120 


accggtgtcc 


acgaagcttt 


ggaaatgaga 


gatgaagaca 


aatccaagtg 


gatgggtaag 


180 


ggtgttatga 


acgctgtcaa 


caacgtcaac 


aacgtcattg 


ctgctgcttt 


cgtcaaggcc 


240 


aacctagatg 


ttaaggacca 


aaaggccgtc 


gatgacttct 


tgttgtcttt 


ggatggtacc 


300 


gccaacaagt 


ccaagttggg 


tgctaacgct 


atcttgggtg 


tctccatggc 


cgctgctaga 


360 


gccgctgctg 


ctgaaaagaa 


cgtcccattg 


taccaacatt 


tggctgactt 


gtctaagtcc 


420 


aagacctctc 


catacgtttt 


gccagttcca 


ttcttgaacg 


ttttgaacgg 


t99ttcccac 


480 


gctggtggtg 


ctttggcttt 


gcaagaattc 


atgattgctc 


caactggtgc 


taagaccttc 


540 


gctgaagcca 


tgagaattgg 


ttccgaagtt 


fcaccacaact 


tgaagtcttt 


gaccaagaag 


600 


agatacggtg 


cttctgccgg 


taacgtcggt 


gacgaaggtg 


gtgttgctcc 


aaacattcaa 


660 


accgctgaag 


aagctttgga 


cttgattgtt 


gacgctatca 


aggctgctgg 


tcacgacggt 


720 


aaggtcaaga 


tcggtttgga 


ctgtgcttcc 


tctgaattct 


tcaaggacgg 


taagtacgac 


780 


ttggacttca 


agaacccaga 


atctgacaaa 


tccaagtggt 


tgactggtgt 


cgaattagct 


840 


gacatgtacc 


actccttgat 


gaagagatac 


ccaattgtct 


ccatcgaaga 


tccatttgct 


900 


gaagatgact 


gggaagcttg 


gtctcacttc 


ttcaagaccg 


ctggtatcca 


aattgttgct 


960 


gatgacttga 


ctgtcaccaa 


cccagctaga 


attgctaccg 


ccatcgaaaa 


gaaggctgct 


1020 


gacgctttgt 


tgttgaaggt 


taaccaaatc 


ggtaccttgt 


ctgaatccat 


caaggctgct 


1080 


caagactctt 


tcgctgccaa 


ctggggtgtt 


atggtttccc 


acagatctgg 


tgaaactgaa 


1140 


gacactttca 


ttgctgactt 


ggttgtcggt 


ttgagaactg 


gtcaaatcaa 


gactggtgct 


1200 


ccagctagat 


ccgaaagatt 


ggctaagttg 


aaccaattgt 


tgagaatcga 


agaagaattg 


1260 


ggtgacaagg 


ctgtctacgc 


cggtgaaaac 


ttccaccacg 


gtgacaagtt 


gtaa 


1314 



<210> 15 

<211> 312 
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<212> PRT 




























<213> Saccharomyces cerevisiae 


















<400> 


15 




























Met Asp 


Ser 


Gin 


Pro 


Val 


Asp 


Val 


Asp 


Asn 


lie 


lie 


Asp 


Arg 


Leu 


Leu 


1 








5 










10 










15 




Glu 


Val 


Arg Gly 


Ser 


Lys 


Pro 


Gly Gin 


Gin 


Val 


Asp 


Leu 


Glu 


Glu 


Asn 








20 










25 










30 






Glu 


lie 


Arg 


Tyr 


Leu 


Cys 


Ser 


Lys 


Ala 


Arg 


Ser 


lie 


Phe 


lie 


Lys 


Gin 






35 










40 










45 








Pro 


lie 


Leu 


Leu 


Glu 


Leu 


Glu 


Ala 


Pro 


lie 


Lys 


lie 


Cys 


Gly Asp 


lie 




50 










55 










60 










His 


Gly Gin 


Tyr 


Tyr Asp 


Leu 


Leu Arg 


Leu 


Phe 


Glu Tyr 


Gly Gly 


Phe 


65 










70 










75 










80 


Pro 


Pro 


Glu 


Ser 


Asn 


Tyr 


Leu 


Phe 


Leu 


Gly Asp Tyr Val 


Asp Arg Gly 










85 










90 










95 




Lys 


Gin 


Ser 


Leu 


Glu 


Thr 


lie 


Cys 


Leu 


Leu 


Leu 


Ala 


Tyr 


Lys 


lie 


Lys 








100 










105 










110 






Tyr 


Pro 


Glu 


Asn 


Phe 


Phe 


lie 


Leu Arg 


Gly Asn 


His 


Glu 


Cys 


Ala 


Ser 






115 










120 










125 








He 


Asn 


Arg 


He 


Tyr 


Gly 


Phe 


Tyr Asp 


Glu 


Cys 


Lys 


Arg Arg 


Tyr 


Asn 




130 










135 










140 










lie 


Lys 


Leu Trp 


Lys 


Thr 


Phe 


Thr 


Asp 


Cys 


Phe 


Asn 


Cys 


Leu 


Pro 


lie 


145 










ISO 










155 










160 


Ala 


Ala 


lie 


lie 


Asp 


Glu 


Lys 


lie 


Phe 


Cys 


Met 


His 


Gly Gly 


Leu 


Ser 










165 










170 










175 




Pro 


Asp 


Leu 


Asn 


Ser 


Met 


Glu 


Gin 


lie 


Arg Arg Val 


Met 


Arg 


Pro 


Thr 








180 










185 










190 






Asp 


lie 


Pro Asp 


Val 


Gly Leu 


Leu 


Cys 


Asp Leu 


Leu 


Trp 


Ser 


Asp 


Pro 






195 










200 










205 








Asp 


Lys 


Asp 


lie 


Val 


Gly Trp 


Ser 


Glu 


Asn Asp Arg 


Gly Val 


Ser 


Phe 




210 










215 










220 










Thr 


Phe 


Gly 


Pro 


Asp 


Val 


Val 


Asn 


Arg 


Phe 


Leu 


Gin 


Lys 


Gin Asp 


Met 


225 










230 










235 










240 
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Glu Leu He Cys Arg Ala His Gin Val Val Glu Asp Gly Tyr Glu Phe 
245 250 255 

Phe Ser Lys Arg Gin Leu Val Thr Leu Phe Ser Ala Pro Asn Tyr Cys 
260 265 270 

Gly Glu Phe Asp Asn Ala Gly Ala Met Met Ser Val Asp Glu Ser Leu 
275 280 285 

Leu Cys Ser Phe Gin lie Leu Lys Pro Ala Gin Lys Ser Leu Pro Arg 
290 295 300 

Gin Ala Gly Gly Arg Lys Lys Lys 
305 310 

<210> 16 
<211> 939 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 16 

atggactcac aaccagttga cgttgataat atcatcgata gattattgga agtaagagga 60 

tctaaacctg gtcaacaagt tgatctagaa gaaaatgaaa tcagatactt atgttcgaaa 120 

gccagatcta tattcataaa gcaacccatt ttactagagt tagaagcccc aattaaaata 180 

tgtggtgaca ttcatgggca atactatgat ttactacgtc tatttgagta cggtggattc 240 

ccgccagaat ctaattatct atttttgggt gattatgtcg accgtggtaa acaatcctta 300 

gagactattt gtctattact ggcttacaaa attaagtatc cagaaaactt tttcatttta 360 

agagggaacc atgaatgtgc ttccattaat agaatttacg ggttttatga tgaatgtaag 420 

agacgttata atatcaaact ttggaaaact ttcacggatt gtttcaattg tttaccaatt 480 

gctgcaatta ttgatgagaa aatcttctgt atgcatggtg gtctctcacc agatttgaat 540 

agtatggaac agatcagaag ggtgatgagg ccaacagata ttcccgacgt tggcttatta 600 

tgtgacttat tgtggtcaga tccagataaa gatatcgtag gttggagtga aaatgataga 660 

ggtgtttctt tcacttttgg tcctgatgta gtgaacagat ttttacagaa acaagatatg 720 

gagttgattt gcagggccca tcaagttgtg gaagatggtt atgaattctt tagtaaaaga 780 

caattggtga cacttttcag tgctccgaat tattgtggtg aatttgataa cgctggtgca 840 

atgatgagtg ttgatgaaag tttattatgt tcttttcaaa ttttaaagcc agcccaaaaa 900 

agtctaccaa ggcaagctgg gggtagaaag aaaaaataa 939 

<210> 17 

<211> 247 
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<212> PRT 




























<213 > Saccharomyces cerevisiae 


















<400> 17 




























Met 


Pro 


Lys 


Leu 


Val 


Leu 


val 


Arg 


His 


Gly Gin 


Ser 


Glu 


Trp 


Asn 


Glu 


1 








5 










10 










15 




Lys 


Asn 


Leu 


Phe 


Thr 


Gly Trp Val 


Asp 


Val 


Lys 


Leu 


Ser 


Ala 


Lys 


Gly 








20 










25 










30 






Gin 


Gin 


Glu 


Ala 


Ala 


Arg 


Ala 


Gly Glu 


Leu 


Leu 


Lys 


Glu 


Lys 


Lys 


Val 






35 










40 










45 








Tyr 


Pro 


Asp Val 


Leu 


Tyr 


Thr 


Ser 


Lys 


Leu 


Ser 


Arg 


Ala 


He 


Gin 


Thr 




50 










55 










60 










Ala 


Asn 


He 


Ala 


Leu 


Glu Lys 


Ala Asp Arg 


Leu 


Trp 


He 


Pro 


Val 


Asn 


65 










70 










75 










80 


Arg 


Ser 


Trp Arg 


Leu 


Asn 


Glu Arg 


His 


Tyr Gly Asp 


Leu 


Gin Gly Lys 










85 










90 










95 




Asp 


Lys 


Ala 


Glu 


Thr 


Leu Lys 


Lys 


Phe 


Gly Glu 


Glu 


Lys 


Phe 


Asn 


Thr 








100 










105 










110 






Tyr Arg 


Arg 


Ser 


Phe 


Asp 


Val 


Pro 


Pro 


Pro 


Pro 


lie 


Asp Ala 


Ser 


Ser 






115 










120 










125 








Pro 


Phe 


Ser 


Gin 


Lys 


Gly Asp 


Glu 


Arg 


Tyr 


Lys 


Tyr 


Val 


Asp 


Pro 


Asn 




130 










135 










14 0 










Val 


Leu 


Pro 


Glu 


Thr 


Glu 


Ser 


Leu 


Ala 


Leu 


Val 


lie 


Asp Arg 


Leu 


Leu 


145 










150 










155 










160 


Pro 


Tyr 


Trp 


Gin 


Asp 


Val 


lie 


Ala 


Lys 


Asp 


Leu 


Leu 


Ser Gly 


Lys 


Thr 










165 










170 










175 




Val 


Met 


lie 


Ala 


Ala 


His 


Gly Asn 


Ser 


Leu Arg Gly 


Leu 


Val 


Lys 


His 








180 










185 










190 






Leu 


Glu 


Gly 


lie 


Ser 


Asp 


Ala 


Asp 


lie 


Ala 


Lys 


Leu 


Asn 


lie 


Pro 


Thr 






195 










200 










205 








Gly 


lie 


Pro 


Leu 


Val 


Phe 


Glu 


Leu 


Asp 


Glu 


Asn 


Leu 


Lys 


Pro 


Ser 


Lys 




210 










215 










220 










Pro 


Ser 


Tyr 


Tyr 


Leu 


Asp 


Pro 


Glu 


Ala 


Ala 


Ala 


Ala 


Gly Ala 


Ala 


Ala 


225 










230 










235 










240 
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Val Ala Asn Gin Gly Lys Lys 
245 



<210> 18 
<211> 744 

<212> DNA 

<213 > Saccharomyces cerevisiae 



<400> 18 

atgccaaagt 


tagttttagt 


tagacacggt 


caatccgaat 


ggaacgaaaa 


gaacttattc 


60 


accggttggg 


ttgatgttaa 


attgtctgcc 


aagggtcaac 


aagaagccgc 


tagagccggt 


120 


gaattgttga 


aggaaaagaa 


ggtctaccca 


gacgtcttgt 


acacttccaa 


gttgtccaga 


180 


gctatccaaa 


ctgctaacat 


tgctttggaa 


aaggctgaca 


gattatggat 


tccagtcaac 


240 


agatcctgga 


gattgaacga 


aagacattac 


ggtgacttac 


aaggtaagga 


caaggctgaa 


300 


actttgaaga 


agttcggtga 


agaaaaattc 


aacacctaca 


gaagatcctt 


cgatgttcca 


360 


cctcccccaa 


tcgacgcttc 


ttctccattc 


tctcaaaagg 


gtgatgaaag 


atacaagtac 


420 


gttgacccaa 


atgtcttgcc 


agaaactgaa 


tctttggctt 


tggtcattga 


cagattgttg 


480 


ccatactggc 


aagatgtcat 


tgccaaggac 


ttgttgagtg 


gtaagaccgt 


catgatcgcc 


540 


gctcacggta 


actccttgag 


aggtttggtt 


aagcacttgg 


aaggtatctc 


tgatgctgac 


600 


attgctaagt 


tgaacatccc 


aactggtatt 


ccattggtct 


tcgaattgga 


cgaaaacttg 


660 


aagccatcta 


agccatctta 


ctacttggac 


ccagaagctg 


ccgctgctgg 


tgccgctgct 


720 


gttgccaacc 


aaggtaagaa 


ataa 








744 



<210> 19 

<211> 327 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> ‘19 

Met Ser Ser Ser Glu Asp Glu Asp Asp Lys Phe Leu Tyr Gly Ser Asp 
15 10 15 



Ser Glu Leu Ala Leu Pro Ser Ser Lys Arg Ser Arg Asp Asp Glu Ala 
20 25 30 



Asp Ala Gly Ala Ser Ser Asn Pro Asp lie Val Lys Arg Gin Lys Phe 
35 40 45 



Asp Ser Pro Val Glu Glu Thr Pro Ala Thr Ala Arg Asp Asp Arg Ser 
50 55 60 



Asp Glu Asp lie Tyr Ser Asp Ser Ser Asp Asp Asp Ser Asp Ser Asp 
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65 70 75 80 



Leu Glu Val lie He Ser Leu Gly Pro Asp Pro Thr Arg Leu Asp Ala 
85 90 95 



Lys Leu Leu Asp Ser Tyr Ser Thr Ala Ala Thr Ser Ser Ser Lys Asp 
100 105 110 



Val lie Ser Val Ala Thr Asp Val Ser Asn Thr He Thr Lys Thr Ser 
115 120 125 



Asp Glu Arg Leu He Thr Glu Gly Glu Ala Asn Gin Gly Val Thr Ala 
130 135 140 



Thr Thr Val Lys Ala Thr Glu Ser Asp Gly Asn Val Pro Lys Ala Met 
145 150 155 160 



Thr Gly Ser lie Asp Leu Asp Lys Glu Gly lie Phe Asp Ser Val Gly 
165 170 175 



lie Thr Thr lie Asp Pro Glu Val Leu Lys Glu Lys Pro Trp Arg Gin 
180 185 190 



Pro Gly Ala Asn Leu Ser Asp Tyr Phe Asn Tyr Gly Phe Asn Glu Phe 
195 200 205 



Thr Trp Met Glu Tyr Leu His Arg Gin Glu Lys Leu Gin Gin Asp Tyr 
210 215 220 



Asn Pro Arg Arg lie Leu Met Gly Leu Leu Ser Leu Gin Gin Gin Gly 
225 230 235 240 



Lys Leu Asn Ser Ala Asn Asp Thr Asp Ser Asn Leu Gly Asn He lie 
245 250 255 



Asp Asn Asn Asn Asn Val Asn Asn Ala Asn Met Ser Asn Leu Asn Ser 
260 265 270 



Asn Met Gly Asn Ser Met Ser Gly Thr Pro Asn Pro Pro Ala Pro Pro 
275 280 285 



Met His Pro Ser Phe Pro Pro Leu Pro Met Phe Gly Ser Phe Pro Pro 
290 295 300 



Phe Pro Met Pro Gly Met Met Pro Pro Met Asn Gin Gin Pro Asn Gin 
305 310 315 320 
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Asn Gin Asn Gin Asn Ser Lys 
325 



< 210 > 20 
<211> 984 

<212 > DNA 

<213> Saccharomyces cerevisiae 
<400> 20 



atgagctcca 


gtgaagacga 


agacgacaag 


ttcttgtatg 


gttccgactc 


cgaattagca 


60 


ctaccttcat 


ctaaacgatc 


aagagatgat 


gaagcagacg 


caggtgcgtc 


cagtaatcct 


120 


gatatagtta 


aaaggcaaaa 


attcgactct 


cccgtggaag 


aaactccagc 


tactgccaga 


180 


gatgatcgtt 


ctgatgaaga 


tatctactct 


gactcctcag 


atgacgatag 


tgattctgac 


240 


ctagaggtta 


tcataagtct 


gggtcctgac 


cctactaggt 


tagatgcaaa 


actactcgat 


300 


tcttattcta 


ccgcagcgac 


atcttcaagc 


aaagacgtaa 


ttagcgtagc 


tacagatgta 


360 


tccaatacca 


tcacaaagac 


atcagatgaa 


agactaataa 


cagaaggaga 


agcaaatcaa 


420 


ggtgtaacgg 


caacgaccgt 


aaaagctaca 


gagagcgatg 


gaaatgtacc 


gaaagcaatg 


480 


actggttcta 


tagacctgga 


taaagaggga 


atctttgata 


gtgttggcat 


aacgacaata 


540 


gatcctgaag 


tattaaagga 


gaaaccctgg 


aggcaaccgg 


gggccaactt 


aagtgattat 


600 


ttcaattacg 


gttttaacga 


atttacctgg 


atggagtatt 


tacatagaca 


ggaaaaacta 


660 


caacaagatt 


ataatcctag 


gaggatccta 


a tgggcctat 


tatccctcca 


acagcaaggg 


720 


aagttgaatt 


ccgcgaatga 


tacagactca 


aacctcggta 


atataattga 


taacaacaac 


780 


aacgtaaaca 


atgcaaatat 


gtctaatctg 


aacagtaata 


tgggtaatag 


tatgtctgga 


840 


acaccaaacc 


ctcccgctcc 


accaatgcat 


ccaagcttcc 


cacccttacc 


tatgtttggt 


900 


agctttccac 


cattccccat 


gccaggtatg 


atgccaccca 


tgaaccaaca 


gcctaatcaa 


960 


aatcaaaatc 


aaaattcgaa 


atga 








984 



< 210 > 21 
<211> 103 

<212> PRT 

<2 13> Saccharomyces cerevisiae 
<400> 21 

Met Ser Gly Arg Gly Lys Gly Gly Lys Gly Leu Gly Lys Gly Gly Ala 
15 10 15 

Lys Arg His Arg Lys lie Leu Arg Asp Asn lie Gin Gly lie Thr Lys 
20 25 30 

Pro Ala He Arg Arg Leu Ala Arg Arg Gly Gly Val Lys Arg lie Ser 
35 40 45 
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Gly Leu lie Tyr Glu Glu Val Arg Ala Val Leu Lys Ser Phe Leu Glu 
50 55 60 



Ser Val lie Arg Asp Ser Val Thr Tyr Thr Glu His Ala Lys Arg Lys 
65 70 75 80 



Thr Val Thr Ser Leu Asp Val Val Tyr Ala Leu Lys Arg Gin Gly Arg 
85 90 95 



Thr Leu Tyr Gly Phe Gly Gly 
100 



< 210 > 22 
<211> 312 

<2 12> DNA 

<213 > Sac char omyces cerevisiae 
<400> 22 

atgtccggta gaggtaaagg tggtaaaggt ctaggaaaag gtggtgccaa gcgtcacaga 60 

aagattctaa gagataacat tcaaggtatc actaagccag ctatcagaag attagctaga 120 

a- 9 ciggtggtg tcaagcgtat ttctggtttg atctacgaag aagtcagagc cgtcttgaaa 180 

tccttcttgg aatccgtcat cagggactct gttacttaca ctgaacacgc caagagaaag 240 

actgttactt ctttggatgt tgtttatgct ttgaagagac aaggtagaac cttatatggt 300 
ttcggtggtt aa 312 



<210> 


23 


<211> 


132 


<212> 


PRT 


<213> 


Saccharomyces cerevisiae 


<400> 


23 



Met Ser Gly Gly Lys Gly Gly Lys Ala Gly^ Ser Ala Ala Lys Ala Ser 
15 10 15 



Gin Ser Arg Ser Ala Lys Ala Gly Leu Thr Phe Pro Val Gly Arg Val 
.20 25 30 



His Arg Leu Leu Arg Arg Gly Asn Tyr Ala Gin Arg He Gly Ser Gly 
35 40 45 



Ala Pro Val Tyr Leu Thr Ala Val Leu Glu Tyr Leu Ala Ala Glu lie 
50 55 60 



Leu Glu Leu Ala Gly Asn Ala Ala Arg Asp Asn Lys Lys Thr Arg lie 
65 70 75 80 
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lie Pro Arg His Leu Gin Leu Ala lie Arg Asn Asp Asp Glu Leu Asn 
85 90 95 

Lys Leu Leu Gly Asn Val Thr lie Ala Gin Gly Gly Val Leu Pro Asn 
100 105 110 

He His Gin Asn Leu Leu Pro Lys Lys Ser Ala Lys Ala Thr Lys Ala 
115 120 125 

Ser Gin Glu Leu 
13 0 



<210> 24 

<211> 399 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 24 

atgtccggtg gtaaaggtgg taaagctggt tcagctgcta aagcttctca atctagatct 
gctaaggctg gtttgacatt cccagtcggt agagtgcaca gattgctaag aagaggtaac 
tacgcccaaa gaattggttc tggtgctcca gtctacttga ctgctgtctt ggaatatttg 
gccgctgaaa ttttagaatt agctggtaat gctgctaggg ataacaagaa gaccagaatt 

attccaagac atttgcaatt ggctatcaga aatgatgacg aattgaacaa gctattgggt 

aacgttacca ttgcccaagg tggtgttttg ccaaacatcc atcaaaactt gttgccaaag 

aagtctgcca aggctaccaa ggcttctcaa gaattataa 



<210> 25 

<211> 705 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 25 

Met Ala Gly Glu Thr Phe Glu Phe Gin Ala Glu lie Thr Gin Leu Met 
15 10 15 



Ser Leu lie lie Asn Thr Val Tyr Ser Asn Lys Glu lie Phe Leu Arg 
20 25 30 



Glu Leu lie Ser Asn Ala Ser Asp Ala Leu Asp Lys lie Arg Tyr Gin 
35 40 45 



Ala Leu Ser Asp Pro Lys Gin Leu Glu Thr Glu Pro Asp Leu Phe lie 
50 55 60 



Arg lie Thr Pro Lys Pro Glu Glu Lys Val Leu Glu lie Arg Asp Ser 
65 70 75 80 



60 

120 

180 

240 

300 

360 

399 
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Gly lie Gly Met Thr Lys Ala Glu Leu lie Asn Asn Leu Gly Thr lie 
85 90 95 



Ala Lys Ser Gly Thr Lys Ala Phe Met Glu Ala Leu Ser Ala Gly Ala 
100 105 110 



Asp Val Ser Met lie Gly Gin Phe Gly Val Gly Phe Tyr Ser Leu Phe 
115 120 125 



Leu Val Ala Asp Arg Val Gin Val lie Ser Lys Asn Asn Glu Asp Glu 
130 135 140 



Gin Tyr He Trp Glu Ser Asn Ala Gly Gly Ser Phe Thr Val Thr Leu 
145 150 155 160 



Asp Glu Val Asn Glu Arg lie Gly Arg Gly Thr Val Leu Arg Leu Phe 
165 170 175 



Leu Lys Asp Asp Gin Leu Glu Tyr Leu Glu Glu Lys Arg lie Lys Glu 
180 185 190 



Val lie Lys Arg His Ser Glu Phe Val Ala Tyr Pro lie Gin Leu Leu 
195 200 205 



Val Thr Lys Glu Val Glu Lys Glu Val Pro lie Pro Glu Glu Glu Lys 
210 215 220 



Lys Asp Glu Glu Lys Lys Asp Glu Asp Asp Lys Lys Pro Lys Leu Glu 
225 230 235 240 



Glu Val Asp Glu Glu Glu Glu Glu Lys Lys Pro Lys Thr Lys Lys Val 
245 250 255 



Lys Glu Glu Val Gin Glu Leu Glu Glu Leu Asn Lys Thr Lys Pro Leu 
260 265 270 



Trp Thr Arg Asn Pro Ser Asp lie Thr Gin Glu Glu Tyr Asn Ala Phe 
275 280 285 



Tyr Lys Ser lie Ser Asn Asp Trp Glu Asp Pro Leu Tyr Val Lys His 
290 295 300 



Phe Ser Val Glu Gly Gin Leu Glu Phe Arg Ala lie Leu Phe lie Pro 
305 310 315 320 



Lys Arg Ala Pro Phe Asp Leu Phe Glu Ser Lys Lys Lys Lys Asn Asn 
325 330 335 




WO 02/092626 PCT/EP02/05359 

37/148 



lie Lys Leu Tyr Val Arg Arg Val Phe lie Thr Asp Glu Ala Glu Asp 
340 345 350 



Leu lie Pro Glu Trp Leu Ser Phe Val Lys Gly Val Val Asp Ser Glu 
355 360 365 



Asp Leu Pro Leu Asn Leu Ser Arg Glu Met Leu Gin Gin Asn Lys lie 
370 375 380 



Met Lys Val He Arg Lys Asn lie Val Lys Lys Leu lie Glu Ala Phe 
385 390 395 400 



Asn Glu lie Ala Glu Asp Ser Glu Gin Phe Asp Lys Phe Tyr Ser Ala 
405 410 415 



Phe Ala Lys Asn lie Lys Leu Gly Val His Glu Asp Thr Gin Asn Arg 
420 425 430 



Ala Ala Leu Ala Lys Leu Leu Arg Tyr Asn Ser Thr Lys Ser Val Asp 
435 440 445 



Glu Leu Thr Ser Leu Thr Asp Tyr Val Thr Arg Met Pro Glu His Gin 
450 455 460 



Lys Asn lie Tyr Tyr lie Thr Gly Glu Ser Leu Lys Ala Val Glu Lys 
465 470 475 480 



Ser Pro Phe Leu Asp Ala Leu Lys Ala Lys Asn Phe Glu Val Leu Phe 
485 490 495 



Leu Thr Asp Pro lie Asp Glu Tyr Ala Phe Thr Gin Leu Lys Glu Phe 
500 505 510 



Glu Gly Lys Thr Leu Val Asp lie Thr Lys Asp Phe Glu Leu Glu Glu 
515 520 525 



Thr Asp Glu Glu Lys Ala Glu Arg Glu Lys Glu lie Lys Glu Tyr Glu 
530 535 540 



Pro Leu Thr Lys Ala Leu Lys Asp lie Leu Gly Asp Gin Val Glu Lys 
545 550 555 560 



Val Val Val Ser Tyr Lys Leu Leu Asp Ala Pro Ala Ala lie Arg Thr 
565 570 575 



Gly Gin Phe Gly Trp Ser Ala Asn Met Glu Arg lie Met Lys Ala Gin 
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580 585 590 



Ala Leu Arg Asp Ser Ser Met Ser Ser Tyr Met Ser Ser Lys Lys Thr 
595 600 605 



Phe Glu He Ser Pro Lys Ser Pro lie lie Lys Glu Leu Lys Lys Arg 
610 615 620 



Val Asp Glu Gly Gly Ala Gin Asp Lys Thr Val Lys Asp Leu Thr Asn 
625 630 635 640 



Leu Leu Phe Glu Thr Ala Leu Leu Thr Ser Gly Phe Ser Leu Glu Glu 
645 650 655 



Pro Thr Ser Phe Ala Ser Arg lie Asn Arg Leu lie Ser Leu Gly Leu 
660 665 670 



Asn lie Asp Glu Asp Glu Glu Thr Glu Thr Ala Pro Glu Ala Ser Thr 
675 680 685 



Glu Ala Pro Val Glu Glu Val Pro Ala Asp Thr Glu Met Glu Glu Val 
690 695 700 



Asp 

705 



<210> 26 
<211> 2000 
<212 > DNA 

<213 > Saccharomyces cerevisiae 
<400> 26 

atggctggtg aaacttttga atttcaagct gaaatcactc agttgatgag tttgatcatc 60 

aacactgtct attctaacaa ggaaattttc ttgagagaac tgatctctaa cgcctccgat 120 

gctttagaca aaattagata ccaagctttg tctgatccaa agcaattgga aaccgaacca 180 

gatttgttca ttagaatcac cccaaaacca gaagaaaaag ttttggaaat cagagattct 240 

ggtattggta tgaccaaggc tgaattgatt aacaatttgg gtaccattgc taagtctggt 300 

actaaagctt tcatggaagc tctatctgct ggtgccgatg tatccatgat tggtcaattc 360 

ggtgttggtt tttactcttt attcttagtc gccgacagag ttcaagttat ttccaagaac 420 

aatgaggacg aacaatatat ttgggaatct aatgccggtg gttctttcac cgttactttg 480 

gacgaagtta acgaaagaat tggtagaggt accgtcttga gattattctt gaaagatgac 540 

caattggagt acttggaaga aaagagaatt aaagaagtca tcaagagaca ttctgaattc 600 



gttgcttacc ctatccaact tctagtcacc aaggaagtcg aaaaggaagt tccaattcca 



660 
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gaagaagaaa 


agaaagacga 


ggaaaagaag 


gatgaagatg 


acaagaaacc 


aaaattggaa 


720 


gaagtcgatg 


aagaagaaga 


agaaaagaag 


ccaaaaacca 


aaaaagttaa 


agaagaggtt 


780 


caagaattag 


aagagttgaa 


caagactaag 


ccattatgga 


ctagaaaccc 


atctgatatc 


840 


actcaagagg 


aatacaatgc 


tttctataag 


tctatttcta 


acgactggga 


agacccattg 


900 


tacgttaagc 


atttctctgt 


tgaaggtcaa 


ttggaattta 


gagctatctt 


gttcattcca 


960 


aagagagcac 


cattcgactt 


atttgagagt 


aagaagaaga 


agaacaatat 


caagttgtac 


1020 


gttcgtcgtg 


tcttcatcac 


tgatgaagct 


gaagacttga 


ttccagagtg 


gttatctttc 


1080 


gtcaagggtg 


ttgttgactc 


tgaagattta 


ccattgaatt 


tgtccagaga 


aatgttacaa 


1140 


caaaataaga 


ttatgaaggt 


tattagaaag 


aatattgtca 


agaaattgat 


tgaagccttc 


1200 


aacgaaatcg 


ctgaagactc 


cgagcaattt 


gacaaatttt 


actctgcctt 


cgctaagaac 


1260 


attaagctgg 


gtgtacatga 


ggacactcaa 


aacagagctg 


ctttagctaa 


gttgctacgt 


1320 


tacaattcta 


ctaaatctgt 


cgatgaattg 


acttccttga 


ctgattacgt 


tactagaatg 


1380 


ccagaacacc 


aaaagaacat 


ctattacatc 


accggtgaat 


ctctaaaggc 


agtcgaaaag 


1440 


tctccattct 


tggacgcctt 


gaaggctaag 


aactttgaag 


ttttgttctt 


gaccgaccca 


1500 


attgatgaat 


acgctttcac 


tcaattgaag 


gaattcgagg 


gtaaaacttt 


ggttgacatt 


1560 


actaaagatt 


tcgaattgga 


agaaacagac 


gaagaaaaag 


ctgaaagaga 


gaaggagatc 


1620 


aaagaatacg 


aaccattgac 


caaggccttg 


aaggatatct 


t999tgacca 


agtggagaag 


1680 


gttgttgttt 


cttacaaatt 


gctagatgct 


ccagctgcca 


tcagaactgg 


tcaattcggc 


1740 


tggtctgcta 


acatggaaag 


aatcatgaag 


gctcaagcct 


tgagagactc 


ttccatgtcc 


1800 


tcctacatgt 


cttccaagaa 


gactttcgaa 


atttctccaa 


aatctccaat 


tattaaggaa 


1860 


ttgaaaaaga 


gagttgatga 


gggtggtgca 


caagataaga 


ccgtcaaaga 


tttgactaac 


1920 


ttattattcg 


agaccgcttt 


gttgacttct 


ggtttcagtt 


tggaagaacc 


aacttctttt 


1980 


gcatcaagaa 


taaatagatt 










2000 



<210> 27 

<211> 523 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 27 

Met Ala Ala lie Arg Asp Tyr Lys Thr Ala Leu Asp Phe Thr Lys Ser 
15 10 15 

Leu Pro Arg Pro Asp Gly Leu Ser Val Gin Glu Leu Met Asp Ser Lys 
20 25 30 

lie Arg Gly Gly Leu Thr Tyr Asn Asp Phe Leu lie Leu Pro Gly Leu 
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35 










40 










45 








Val 


Asp 


Phe 


Ala 


Ser 


Ser 


Glu 


Val 


Ser 


Leu 


Gin 


Thr 


Lys 


Leu 


Thr Arg 




50 










55 










60 










Asn 


lie 


Thr 


Leu 


Asn 


lie 


Pro 


Leu 


Val 


Ser 


Ser 


Pro 


Met 


Asp 


Thr 


Val 


65 










70 










75 










80 


Thr 


Glu 


Ser 


Glu 


Met 


Ala 


Thr 


Phe 


Met 


Ala 


Leu 


Leu 


Gly Gly 


lie 


Gly 










85 










90 










95 




Phe 


He 


His 


His 


Asn 


Cys 


Thr 


Pro 


Glu 


Asp 


Gin 


Ala 


Asp 


Met 


Val 


Arg 








100 










105 










110 






Arg 


Val 


Lys 


Asn 


Tyr 


Glu 


Asn 


Gly Phe 


lie 


Asn 


Asn 


Pro 


lie 


Val 


lie 






115 










120 










125 








Ser 


Pro 


Thr 


Thr 


Thr 


Val 


Gly Glu 


Ala 


Lys 


Ser 


Met 


Lys 


Glu 


Lys 


Tyr 




130 










135 










140 










Gly 


Phe 


Ala Gly' 


Phe 


Pro 


Val 


Thr 


Thr 


Asp 


Gly Lys 


Arg 


Asn 


Ala 


Lys 


145 










150 










155 










160 


Leu 


Val 


Gly Val 


lie 


Thr 


Ser Arg 


Asp 


He 


Gin 


Phe 


Val 


Glu 


Asp 


Asn 










165 










170 










175 




Ser 


Leu 


Leu 


Val 


Gin 


Asp 


Val 


Met 


Thr 


Lys 


Asn 


Pro 


Val 


Thr 


Gly Ala 








180 










185 










190 






Gin Gly 


lie 


Thr 


Leu 


Ser 


Glu Gly Asn 


Glu 


lie 


Leu 


Lys 


Lys 


lie 


Lys 






195 










200 










205 








Lys 


Gly Arg Leu 


Leu 


Val 


Val 


Asp 


Glu 


Lys 


Gly Asn 


Leu 


Val 


Ser 


Met 




210 










215 










220 










Leu 


Ser 


Arg 


Thr 


Asp 


Leu 


Met 


Lys 


Asn 


Gin Asn Tyr 


Pro 


Leu 


Ala 


Ser 


225 










230 










235 










240 


Lys 


Ser 


Ala 


Asn 


Thr 


Lys 


Gin 


Leu 


•Leu 


Cys 


Gly Ala 


Ser 


lie 


Gly • Thr 










245 










250 










255 




Met 


Asp Ala Asp 


Lys 


Glu Arg 


Leu Arg 


Leu 


Leu 


Val 


Lys 


Ala 


Gly Leu 








260 










265 










270 






Asp 


Val 


Val 


He 


Leu Asp 


Ser 


Ser 


Gin 


Gly Asn 


Ser 


lie 


Phe 


Glu 


Leu 






275 










280 










285 










WO 02/092626 



PCT/EP02/05359 



Asn Met Leu Lys Trp Val Lys Glu 
290 295 



Ala Gly Asn Val Val Thr Arg Glu 
305 310 



Gly Ala Asp Gly Leu Arg lie Gly 
325 



Thr Gin Glu Val Met Ala Cys Gly 
340 



Asn Val Cys Glu Phe Ala Asn Gin 
355 360 



Gly Gly Val Gin Asn lie Gly His 
370 375 



Ser Ser Thr Val Met Met Gly Gly 
385 390 



Pro Gly Glu Tyr Phe Tyr Gin Asp 
405 



Gly Met Gly Ser He Asp Ala Met 
420 



Ala Ser Thr Ser Arg Tyr Phe Ser 
435 440 



Gin Gly Val Ser Gly Ala Val Val 
450 455 



lie Pro Tyr Leu Tyr Asn Gly Leu 
465 470 



Cys Arg Ser Leu Thr Leu Leu Lys 
485 



Arg Phe Glu Phe Arg Thr Ala Ser 
500 



Asn Leu His Ser Tyr Glu Lys Arg 
515 520 



41/148 

Ser Phe Pro Gly Leu Glu Val lie 
300 



Gin Ala Ala Asn Leu lie Ala Ala 
315 320 



Met Gly Thr Gly Ser lie Cys lie 
330 335 



Arg Pro Gin Gly Thr Ala Val Tyr 
345 350 



Phe Gly Val Pro Cys Met Ala Asp 
365 



lie Thr Lys Ala Leu Ala Leu Gly 
380 



Met Leu Ala Gly Thr Thr Glu Ser 
395 400 



Gly Lys Arg Leu Lys Ala Tyr Arg 
410 415 



Gin Lys Thr Gly Thr Lys Gly Asn 
425 430 



Glu Ser Asp Ser Val Leu Val Ala 
445 



Asp Lys Gly Ser lie Lys Lys Phe 
460 



Gin His Ser Cys Gin Asp lie Gly 
475 480 



Asn Asn Val Gin Arg Gly Lys Val 
490 495 



Ala Gin Leu Glu Gly Gly Val His 
505 510 



Leu His Asn 



<210> 28 
<211> 1572 

<212> DNA 
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<213> Saccharomyces cerevisiae 



<400> 28 

atggccgcca 


ttagagacta 


caagaccgca 


ctagacttta 


ccaagagcct 


accaagaccg 


60 


gatggtttgt 


cagtgcagga 


attgatggac 


tccaagatca 


gaggtgggtt 


gacttataac 


120 


gattttttaa 


tcttaccagg 


tttagtcgat 


tttgcgtcct 


ctgaagttag 


cctacagacc 


180 


aagctaacca 


ggaatattac 


tttaaatatt 


ccattagttt 


cctctccaat 


ggacacggtg 


240 


acagagtcag 


aaatggccac 


ttttatggct 


ctgttgggtg 


gtatcggttt 


cattcaccat 


300 


aactgtaccc 


cagaggacca 


agctgacatg 


gtcagaagag 


tcaagaacta 


tgaaaatggg 


360 


tttattaaca 


accctatagt 


gatttctcca 


actacgaccg 


ttggtgaagc 


taagagcatg 


420 


aaggaaaagt 


atggatttgc 


aggcttccct 


gtcacgacag 


atggaaagag 


aaatgcaaag 


480 


ttggtgggtg 


tcatcacctc 


tcgtgatata 


caattcgttg 


aggacaactc 


tttactcgtt 


540 


caggatgtca 


tgaccaaaaa 


ccctgttacc 


ggcgcacaag 


gtatcacatt 


atcagaaggt 


600 


aacgaaattc 


taaagaaaat 


caaaaagggt 


aggctattgg 


ttgttgatga 


aaagggtaac 


660 


ttagtttcta 


tgctttcccg 


aactgattta 


atgaaaaatc 


agaactaccc 


attagcgtcc 


720 


aaatctgcca 


acaccaagca 


actgttatgt 


ggtgcttcta 


ttgggactat 


ggacgctgat 


780 


aaagaaagac 


taagattatt 


ggtaaaagct 


gggttggatg 


tcgtcatatt 


ggattcatcc 


840 


caaggaaact 


ctatcttcga 


attgaacatg 


ctcaagtggg 


tcaaagagag 


tttcccaggt 


900 


ctggaagtca 


tcgctggtaa 


cgttgtcacc 


agggaacaag 


ctgccaattt 


gattgctgcc 


960 


ggtgcggacg 


gtttgagaat 


tggtatggga 


actggctcta 


tttgtattac 


ccaagaagtt 


1020 


atggcttgtg 


gtaggccaca 


aggtacagcc 


gtctacaatg 


tgtgtgaatt 


tgctaaccaa 


1080 


ttcggtgttc 


catgtatggc 


tgatggtggt 


gttcaaaaca 


ttggtcatat 


taccaaagct 


1140 


ttggctcttg 


gttcttctac 


tgttatgatg 


99 tggtatgt 


tggccggtac 


taccgaatca 


1200 


ccaggtgaat 


atttctatca 


agatggtaaa 


agattgaagg 


cgtatcgtgg 


tatgggctcc 


1260 


attgacgcca 


tgcaaaagac 


tggtaccaaa 


ggtaatgcat 


ctacctcccg 


ttacttttcc 


1320 


gaatcagaca 


gtgttttggt 


cgcacaaggt 


gtctccggtg 


ctgtcgttga 


caaaggatcc 


1380 


attaagaaat 


ttattccata 


tttgtacaat 


ggtttacaac 


attcctgtca 


agacatcggc 


1440 


tgtaggtcgc 


taactttatt 


aaagaataat 


gttcaaaggg 


gtaaagttag 


atttgaattc 


1500 


agaaccgctt 


ctgctcaact 


agaaggtggc 


gttcataatt 


tacattctta 


cgaaaagcgt 


1560 


ttacataact 


ga 










1572 



<210> 29 

<211> 524 

<212> PRT 

<213> Saccharomyces cerevisiae 






WO 02/092626 
























PCT/EP02/05359 




















43/148 












<400> 29 




























Met 


Ser 


Ala 


Ala 


Pro 


Leu 


Asp 


Tyr Lys 


Lys 


Ala 


Leu 


Glu 


His 


Leu 


Lys 


1 








5 










10 










15 




Thr 


Tyr 


Ser 


Ser 


Lys 


Asp 


Gly Leu 


Ser 


Val 


Gin 


Glu 


Leu 


Met 


Asp 


Ser 








20 










25 










30 






Thr 


Thr 


Arg 


Gly Gly 


Leu 


Thr 


Tyr 


Asn Asp 


Phe 


Leu 


Val 


Leu 


Pro 


Gly 






35 










40 










45 








Leu 


Val 


Asn 


Phe 


Pro 


Ser 


Ser 


Ala 


Val 


Ser 


Leu 


Gin 


Thr 


Lys 


Leu 


Thr 




50 










55 










60 










Lys 


Lys 


lie 


Thr 


Leu 


Asn 


Thr 


Pro 


Phe 


Val 


Ser 


Ser 


Pro 


Met 


Asp 


Thr 


65 










70 










75 










80 


Val 


Thr 


Glu 


Ala 


Asp 


Met 


Ala 


lie 


Tyr 


Met 


Ala 


Leu 


Leu 


Gly Gly 


lie 










85 










90 










95 




Gly 


Phe 


He 


His 


His 


Asn 


Cys 


Thr 


Pro 


Lys 


Glu 


Gin 


Ala 


Ser 


Met 


Val 








100 










105 










110 






Lys 


Lys 


Val 


Lys 


Met 


Phe 


Glu 


Asn 


Gly 


Phe 


lie 


Asn 


Ser 


Pro 


lie 


Val 






115 










120 










125 








He 


Ser 


Pro 


Thr 


Thr 


Thr 


Val 


Gly 


Glu 


Val 


Lys 


Val 


Met 


Lys 


Arg 


Lys 




130 










135 










140 










Phe 


Gly 


Phe 


Ser 


Gly 


Phe 


Pro 


Val 


Thr 


Glu Asp 


Gly Lys 


Cys 


Pro 


Gly 


145 










150 










155 










160 


Lys 


Leu 


Val 


Gly Leu 


Val 


Thr 


Ser Arg Asp 


lie 


Gin 


Phe 


Leu 


Glu Asp 










165 










170 










175 




Asp 


Ser 


Leu 


Val 


Val 


Ser 


Glu 


val 


Met 


Thr 


Lys 


Asn 


Pro 


Val 


Thr Gly 








180 










185 










190 






lie 


Lys 


Gly 


lie 


Thr 


Leu 


Lys 


Glu Gly Asn 


Glu 


lie 


Leu 


Lys 


Gin 


Thr 






195 










200 










205 








Lys 


Lys 


Gly Lys 


Leu 


Leu 


lie 


Val 


Asp Asp Asn 


Gly Asn 


Leu 


Val 


Ser 




210 










215 










220 










Met 


Leu 


Ser 


Arg 


Ala 


Asp 


Leu 


Met 


Lys 


Asn 


Gin 


Asn 


Tyr 


Pro 


Leu 


Ala 


225 










230 










235 










240 


Ser 


Lys 


Ser 


Ala 


Thr 


Thr 


Lys 


Gin 


Leu 


Leu 


Cys 


Gly Ala 


Ala 


lie 


Gly 
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245 250 255 



Thr lie Glu Ala Asp Lys Glu Arg Leu Arg Leu Leu Val Glu Ala Gly 
260 265 270 



Leu Asp Val Val lie Leu Asp Ser Ser Gin Gly Asn Ser Val Phe Gin 
275 280 285 



Leu 


Asn 

290 


Met lie Lys 


Trp 


He 

295 


Lys 


Glu 


Thr 


Phe 


Pro 

300 


Asp 


Leu 


Glu 


lie 


lie 


Ala 


Gly Asn Val 


Ala 


Thr 


Arg 


Glu 


Gin 


Ala 


Ala 


Asn 


Leu 


lie 


Ala 



305 310 315 320 



Ala Gly Ala Asp Gly Leu Arg lie Gly Met Gly Ser Gly Ser lie Cys 
325 330 335 



He Thr Gin Glu Val Met Ala Cys Gly Arg Pro Gin Gly Thr Ala Val 
340 345 350 



Tyr Asn Val Cys Gin Phe Ala Asn Gin Phe Gly Val Pro Cys Met Ala 
355 360 365 



Asp Gly Gly Val Gin Asn lie Gly His lie Thr Lys Ala Leu Ala Leu 
370 375 380 



Gly Ser Ser Thr Val Met Met Gly Gly Met Leu Ala Gly Thr Thr Glu 
385 390 395 400 



Ser Pro Gly Glu Tyr Phe Tyr Lys Asp Gly Lys Arg Leu Lys Ala Tyr 
405 410 415 



Arg Gly Met Gly Ser lie Asp Ala Met Gin Lys Thr Gly Asn Lys Gly 
420 425 430 



Asn Ala Ser Thr Ser Arg Tyr Phe Ser Glu Ser Asp Ser Val Leu Val 
435 440 445 



Ala Gin Gly Val Ser Gly Ala Val Val Asp Lys Gly Ser lie Lys Lys 
450 455 460 



Phe lie Pro Tyr Leu Tyr Asn Gly Leu Gin His Ser Cys Gin Asp lie 
465 470 475 480 



Gly Cys Glu Ser Leu Thr Ser Leu Lys Glu Asn Val Gin Asn Gly Glu 
485 490 495 
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Val Arg Phe Glu Phe Arg Thr Ala Ser Ala Gin Leu Glu Gly Gly Val 
500 505 510 

His Asn Leu His Ser Tyr Glu Lys Arg Leu Tyr Asn 
515 520 

<210> 30 

<211> 1575 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 30 



atgagtgctg 


ctccattgga 


ttacaaaaag 


gctttagaac 


atttgaagac 


atacagttcc 


60 


aaggacggtc 


tttctgtcca 


ggaattgatg 


gactccacaa 


ctagaggtgg 


gttgacctac 


120 


aatgattfctt 


tggtcttacc 


aggcttggtc 


aatttcccat 


cttctgctgt 


cagtttgcaa 


180 


accaaattga 


ccaagaaaat 


cactttgaac 


actccttttg 


tctcttctcc 


tatggacaca 


240 


gtcactgaag 


ctgatatggc 


tatttatatg 


gctttattgg 


gtggtattgg 


tttcatccat 


300 


cacaactgta 


ctccaaagga 


acaggcttcc 


atggtcaaga 


aagttaaaat 


gtttgaaaac 


360 


ggtttcatca 


attctccaat 


agtaatttct 


ccaaccacca 


ctgttggtga 


agttaaggtt 


420 


atgaagagaa 


agtttggttt 


ctctggcttc 


ccagttactg 


aagacggtaa 


gtgtccaggt 


480 


aagttagttg 


ggttagtcac 


ctctcgtgat 


atacaattct 


tagaagatga 


ttctttggtt 


540 


gtttctgaag 


tcatgactaa 


aaatccagtt 


actggcatca 


a 99 gtattac 


tttgaaagaa 


600 


ggtaatgaaa 


tcctaaaaca 


aaccaagaaa 


ggtaaattgc 


ttatcgttga 


tgataacggt 


660 


aacctcgttt 


ccatgttgtc 


aagagcggat 


ttgatgaaga 


atcaaaacta 


cccgttagct 


720 


tctaaatccg 


ccaccaccaa* 


gcaattgcta 


tgtggtgctg 


caattggtac 


tatcgaagct 


780 


gataaggaaa 


gattaagact 


attagtcgaa 


gcaggtttgg 


atgttgttat 


cttagattcc 


840 


tctcaaggta 


actctgtttt 


ccaattgaac 


atgatcaaat 


ggattaaaga 


aactttccca 


900 


gatttggaaa 


tcattgctgg 


taacgttgcc 


accagagaac 


aagctgctaa 


cttgattgct 


960 


gccggtgccg 


atggtttaag 


aattggtatg 


ggttctgggt 


ctatttgtat 


cactcaagaa 


1020 


gttatggctt 


gtggtagacc 


acaaggtaca 


gctgtctaca 


acgtttgtca 


atttgctaac 


1080 


caatttggcg 


ttccatgtat 


ggctgatggt 


ggtgtccaaa 


acattggcca 


catcaccaag 


1140 


gctttggccc 


ttggttcctc 


caccgtcatg 


atgggtggta 


tgttagctgg 


tactaccgag 


1200 


tcaccaggtg 


aatacttcta 


caaggatggt 


aagagattga 


aggcttatcg 


tggtatgggg 


1260 


tccattgacg 


ccatgcaaaa 


gaccggtaac 


aa 99gtaatg 


cctctacttc 


tcgttatttc 


1320 


tctgagtcag 


acagtgtctt 


ggttgcacaa 


ggtgtttctg 


gggctgtcgt 


agacaaaggt 


1380 


tccatcaaga 


agtttattcc 


atatttgtac 


aacggtttac 


aacattcttg 


tcaagacatt 


1440 


ggttgcgaat 


ctctaacttc 


attgaaagag 


aatgttcaaa 


atggtgaagt 


tagatttgaa 


1500 
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ttcagaaccg cttctgctca attagaaggt ggtgtccata atttgcactc ctatgaaaaa 
cgtctataca attga 



<210> 31 

<211> 767 

<212> PRT 

<213 > Sac char omyces cerevisiae 
<400> 31 

Met Val Gin Ser Ala Val Leu Gly Phe Pro Arg lie Gly Pro Asn Arg 
15 10 15 



Glu Leu Lys Lys Ala Thr Glu Gly 
20 



Asp Glu Leu Phe Lys Val Gly Lys 
35 40 



Leu Gin Lys Glu Ala Gly Val Asp 
50 55 



Phe Tyr Asp Gin Val Leu Asp Leu 
65 70 



Asp Arg Tyr Thr Lys Tyr Asp Leu 
85 



Tyr Trp Asn Gly Lys lie Thr Val 
25 30 



Asp Leu Arg Thr Gin Asn Trp Lys 
45 



lie lie Pro Ser Asn Asp Phe Ser 
60 



Ser Leu Leu Phe Asn Val lie Pro 
75 80 



Ser Pro lie Asp Thr Leu Phe Ala 
90 95 



Met Gly Arg Gly Leu Gin Arg Lys Ala Thr Glu Thr Glu Lys Ala Val 
100 105 * 110 



Asp Val Thr Ala Leu Glu Met Val Lys Trp Phe Asp Ser Asn Tyr His 
115 120 125 



Tyr Val Arg Pro Thr Phe Ser Lys Thr Thr Gin Phe Lys Leu Asn Gly 
130 135 140 



Gin Lys Pro Val Asp Glu Phe Leu Glu Ala Lys Glu Leu Gly lie His 
145 150 155 160 



Thr Arg Pro Val Leu Leu Gly Pro Val Ser Tyr Leu Phe Leu Gly Lys 
165 170 175 



Ala Asp Lys Asp Ser Leu Asp Leu Glu Pro Leu Ser Leu Leu Glu Gin 
180 185 190 



Leu Leu Pro Leu Tyr Thr Glu lie Leu Ser Lys Leu Ala Ser Ala Gly 
195 200 205 



1560 

1575 






WO 02/092626 
























PCT/EP02/05359 




















47/148 












Ala 


Thr 


Glu 


Val 


Gin 


lie Asp 


Glu 


Pro 


Val 


Leu 


Val 


Leu Asp 


Leu 


Pro 




210 










215 










220 










Ala 


Asn 


Ala 


Gin 


Ala 


Ala 


He 


Lys 


Lys 


Ala 


Tyr 


Thr 


Tyr 


Phe 


Gly 


Glu 


225 










230 










235 










240 


Gin 


Ser 


Asn 


Leu 


Pro 


Lys 


lie 


Thr 


Leu 


Ala 


Thr 


Tyr 


Phe 


Gly 


Thr 


Val 










245 










250 










255 




Val 


Pro 


Asn 


Leu Asp Ala 


lie 


Lys 


Gly Leu 


Pro 


Val 


Ala 


Ala 


Leu 


His 








260 










265 










270 






Val 


Asp 


Phe 


Val 


Arg 


Ala 


Pro 


Glu 


Gin 


Phe 


Asp 


Glu 


Val 


Val 


Ala 


Ala 






275 










280 










285 








lie Gly 


Asn 


Lys 


Gin 


Thr 


Leu 


Ser 


Val 


Gly 


lie 


Val 


Asp 


Gly Arg 


Asn 




290 










295 










300 










He 


Trp 


Lys 


Asn Asp 


Phe 


Lys 


Lys 


Ser 


Ser 


Ala 


He 


Val 


Asn 


Lys 


Ala 


305 










310 










315 










320 


lie 


Glu 


Lys 


Leu 


Gly Ala 


Asp 


Arg 


Val 


Val 


Val 


Ala 


Thr 


Ser 


Ser 


Ser 










325 










330 










335 




Leu 


Leu 


His 


Thr 


Pro 


Val 


Asp 


Leu 


Asn 


Asn 


Glu 


Thr 


Lys 


Leu 


Asp 


Ala 








340 










345 










350 






Glu 


lie 


Lys 


Gly 


Phe 


Phe 


Ser 


Phe 


Ala 


Thr 


Gin 


Lys 


Leu 


Asp 


Glu 


Val • . 






355 










360 










365 








Val 


Val 


lie 


Thr 


Lys 


Asn 


Val 


Ser 


Gly Gin Asp 


Val 


Ala 


Ala 


Ala 


Leu 




370 










375 










380 










Glu 


Ala 


Asn 


Ala 


Lys 


Ser 


Val 


Glu 


Ser Arg Gly Lys 


Ser 


Lys 


Phe 


lie 


385 










390 










395 










400 


His 


Asp 


Ala 


Ala 


Val 


Lys 


Ala 


Arg 


Val 


Ala 


Ser 


lie 


Asp 


Glu 


Lys 


Met 










405 










410 










415 




Ser 


Thr 


Arg 


Ala 


Ala 


Pro 


Phe 


Glu 


Gin Arg 


Leu 


Pro 


Glu 


Gin 


Gin 


Lys 








420 










425 










430 






Val 


Phe 


Asn 


Leu 


Pro 


Leu 


Phe 


Pro 


Thr 


Thr 


Thr 


lie 


Gly 


Ser 


Phe 


Pro 






435 










440 










445 








Gin 


Thr 


Lys 


Asp 


lie 


Arg 


lie 


Asn Arg 


Asn 


Lys 


Phe 


Asn 


Lys 


Gly 


Thr 
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450 455 460 



lie Ser Ala Glu Glu Tyr Glu Lys Phe lie Asn Ser Glu lie Glu Lys 
465 470 475 480 



Val He Arg Phe Gin Glu Glu lie Gly Leu Asp Val Leu Val His Gly 
485 490 495 



Glu Pro Glu Arg Asn Asp Met Val Gin Tyr Phe Gly Glu Gin He Asn 
500 505 510 



Gly Tyr Ala Phe Thr Val Asn Gly Trp Val Gin Ser Tyr Gly Ser Arg 
515 520 525 



Tyr Val Arg Pro Pro lie lie Val Gly Asp Leu Ser Arg Pro Lys Ala 
530 535 540 



Met Ser Val Lys Glu Ser Val Tyr Ala Gin Ser lie Thr Ser Lys Pro 
545 550 555 560 



Val Lys Gly Met Leu Thr Gly Pro lie Thr Cys Leu Arg Trp Ser Phe 
565 570 575 



Pro Arg Asp Asp Val Asp Gin Lys Thr Gin Ala Met Gin Leu Ala Leu 
580 585 590 



Ala Leu Arg Asp Glu Val Asn Asp Leu Glu Ala Ala Gly lie Lys Val 
595 600 605 



lie Gin Val Asp Glu Pro Ala Leu Arg Glu Gly Leu Pro Leu Arg Glu 
610 615 620 



Gly Thr Glu Arg Ser Ala Tyr Tyr Thr Trp Ala Ala Glu Ala Phe Arg 
625 630 635 640 



Val Ala Thr Ser Gly Val Ala Asn Lys Thr Gin He His Ser His Phe 
645 650 655 



Cys Tyr Ser Asp Leu Asp Pro Asn His lie Lys Ala Leu Asp Ala Asp 
660 665 670 



Val Val Ser lie Glu Phe Ser Lys Lys Asp Asp Ala Asn Tyr lie Ala 
675 680 685 



Glu Phe Lys Asn Tyr Pro Asn His lie Gly Leu Gly Leu Phe Asp lie 
690 695 700 
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His Ser Pro Arg lie Pro Ser Lys Asp Glu Phe He Ala Lys lie Ser 

705 710 715 720 

Thr lie Leu Lys Ser Tyr Pro Ala Glu Lys Phe Trp Val Asn Pro Asp 

725 730 735 

Cys Gly Leu Lys Thr Arg Gly Trp Glu Glu Thr Arg Leu Ser Leu Thr 
740 745 750 

His Met Val Glu Ala Ala Lys Tyr Phe Arg Glu Gin Tyr Lys Asn 
755 760 765 

<210> 32 

<211> 2304 

<212 > DNA 

<213 > Saccharomyces cerevisiae 
<400> 32 

atggttcaat ctgctgtctt agggttccca agaatcggtc caaacagaga attaaagaag 60 

gccactgaag gttactggaa cggtaaaatc actgtcgatg aattattcaa ggtcggtaag 120 

gatttgagaa ctcaaaactg gaagttgcaa aaggaggctg gtgttgatat catcccatcc 180 

aatgacttct ccttttacga ccaagttttg gatttgtctt tgttgttcaa tgtcattcca 240 

gaccgttaca ctaagtacga tctatctcca atcgacactt tgtttgctat gggtagaggt 300 

ttacaaagaa aggccactga aactgaaaag gctgtcgacg tcactgcttt ggaaatggtt 360 

aaatggttcg actctaacta ccattacgtt agaccaactt tctccaagac cactcaattt 420 

aagttgaacg gccaaaagcc agttgacgaa tttttggaag ccaaggagtt aggtattcat 480 

actagacctg tcttgttggg tccagtttct tacttattct tgggtaaggc tgacaaggat 540 

tctctagatt tggaaccatt gtccctattg gaacaattgt tgcctctata cactgaaatc 600 

ctatctaaat tggcttctgc tggtgccact gaagttcaaa ttgacgaacc tgtcttagtt 660 

ttggacttgc ctgccaacgc ccaagccgcc attaagaagg cttacactta cttcggtgaa 720 

caaagcaatc taccaaagat tactttggct acttacttcg gtaccgttgt ccctaactta 780 

gacgccatca agggcttgcc agttgctgcc ttacacgttg actttgttag agctccagaa 840 

caatttgatg aagtcgttgc cgccattggt aacaaacaaa ccttgtccgt tggtattgtt 900 

gatggtagaa acatttggaa gaatgatttc aagaagtctt ccgctatcgt taacaaggct 960 

attgaaaagt tgggtgctga cagagtcgtt gttgccactt cttcttctct attgcacaca 1020 

ccagttgatt tgaacaacga aaccaagttg gacgctgaaa tcaagggctt tttctctttc 1080 

gccactcaaa aattggatga agttgttgtg atcaccaaga acgtttccgg tcaagacgtt 1140 

gctgctgccc tagaagctaa cgctaaatct gttgaatcca gaggtaaatc caagtttatc 1200 

cacgatgctg ccgttaaggc cagagttgcc tctatcgacg aaaaaatgtc tactagagca 1260 
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gctccatttg 


aacaaagatt 


gcctgaacaa 


caaaaagtct 


tcaacttgcc 


attgttccca 


1320 


acaacaacta 


ttggttcctt 


ccctcaaacc 


aaggacatca 


gaattaacag 


aaacaaattc 


1380 


aacaagggca 


ccatctctgc 


tgaagaatat 


gaaaaattca 


tcaattctga 


aattgaaaag 


1440 


gtcatcagat 


tccaagaaga 


aattggtttg 


gatgtcttag 


tccacggtga 


accagaaaga 


1500 


aacgatatgg 


ttcaatactt 


cggtgaacaa 


atcaacggtt 


atgctttcac 


tgttaacggt 


1560 


tgggttcaat 


cttacggttc 


cagatatgtc 


agaccaccaa 


ttattgttgg 


tgacttgtcc 


1620 


agaccaaagg 


ctatgtccgt 


caaggaatct 


gtttacgctc 


aatccatcac 


ttctaagcca 


1680 


gtaaagggta 


tgttgactgg 


tccaattacc 


tgtttgagat 


ggtcttt ccc 


aagagacgat 


1740 


gtcgaccaaa 


aaactcaagc 


tatgcaatta 


gctttggctt 


tgagagatga 


agtcaatgat 


1800 


ttggaagctg 


ccggtatcaa 


ggttatccaa 


gttgatgaac 


cagctttaag 


agaaggttta 


1860 


ccattgagag 


aaggtactga 


gagatctgct 


tactacacct 


gggctgccga 


agctttcaga 


1920 


gttgctactt 


ctggtgttgc 


taacaagact 


caaatacact 


ctcatttctg 


ttactctgac 


1980 


ttggatccaa 


accatatcaa 


ggctttggat 


gctgatgttg 


tttccatcga 


attctctaag 


2040 


aaggacgatg 


ctaactacat 


tgctgaattc 


aaaaactatc 


caaaccacat 


tggtctaggt 


2100 


ttattcgata 


ttcattctcc 


aagaattc ca 


tcaaaggatg 


aatttatcgc 


caagatttca 


2160 


accatcttga 


agagctaccc 


agctgaaaag 


ttctgggtta 


acccagattg 


tggtttgaag 


2220 


actagaggct 


gggaagaaac 


tagattgtct 


ttgactcata 


tggtcgaagc 


cgccaagtac 


2280 


ttccgtgaac 


aatacaagaa 


ttaa 








2304 



<210> 33 

<211> 577 

<212> PRT 

<213 > Sac char oinyces cerevisiae 
<400> 33 

Met Ala Asp lie Thr Asp Lys Thr Ala Glu Gin Leu Glu Asn Leu Asn 
15 10 15 

lie Gin Asp Asp Gin Lys Gin Ala Ala Thr Gly Ser Glu Ser Gin Ser 
20 25 30 

Val Glu Asn Ser Ser Ala Ser Leu Tyr Val Gly Asp Leu Glu Pro Ser 
35 40 45 

Val Ser Glu Ala His Leu Tyr Asp lie Phe Ser Pro lie Gly Ser Val 
50 55 60 

Ser Ser lie Arg Val Cys Arg Asp Ala lie Thr Lys Thr Ser Leu Gly 
65 70 75 80 
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Tyr Ala 


Tyr 


Val 


Asn 


Phe 


Asn 


Asp 


His 


Glu 


Ala 


Gly Arg 


Lys 


Ala 


lie 










85 










90 










95 




Glu 


Gin 


Leu 


Asn 


Tyr 


Thr 


Pro 


lie 


Lys 


Gly Arg 


Leu 


Cys 


Arg 


lie 


Met 








100 










105 










110 






Trp 


Ser 


Gin Arg 


Asp 


Pro 


Ser 


Leu 


Arg 


Lys 


Lys 


Gly 


Ser 


Gly Asn 


lie 






115 










120 










125 








Phe 


lie 


Lys 


Asn 


Leu 


His 


Pro Asp 


lie 


Asp 


Asn 


Lys 


Ala 


Leu 


Tyr Asp 




130 










135 










140 










Thr 


Phe 


Ser 


Val 


Phe 


Gly Asp 


lie 


Leu 


Ser 


Ser 


Lys 


lie 


Ala 


Thr Asp 


145 










150 










155 










160 


Glu 


Asn 


Gly Lys 


Ser 


Lys 


Gly 


Phe 


Gly 


Phe 


Val 


His 


Phe 


Glu 


Glu 


Glu 










165 










170 










175 




Gly Ala 


Ala 


Lys 


Glu 


Ala 


lie Asp 


Ala 


Leu 


Asn 


Gly Met 


Leu 


Leu 


Asn 








180 










185 










190 






Gly Gin 


Glu 


lie 


Tyr 


Val 


Ala 


Pro 


His 


Leu 


Ser Arg 


Lys 


Glu Arg Asp 






195 










200 










205 








Ser 


Gin 


Leu 


Glu 


Glu 


Thr 


Lys 


Ala 


His 


Tyr 


Thr 


Asn 


Leu 


Tyr 


Val 


Lys 




210 










215 










220 










Asn 


He 


Asn 


Ser 


Glu 


Thr 


Thr Asp 


Glu 


Gin 


Phe 


Gin 


Glu 


Leu 


Phe 


Ala 


225 










230 










235 










240 


Lys 


Phe 


Gly 


Pro 


lie 


Val 


Ser 


Ala 


Ser 


Leu 


Glu Lys Asp Ala Asp Gly 










245 










250 










255 




Lys 


Leu 


Lys 


Gly 


Phe 


Gly 


Phe 


Val 


Asn 


Tyr 


Glu Lys 


His 


Glu Asp 


Ala 








260 










265 










270 






Val 


Lys 


Ala 


Val 


Glu 


Ala 


Leu 


Asn Asp 


Ser 


Glu 


Leu 


Asn 


Gly Glu 


Lys 






275 










280 










285 








Leu Tyr 


Val 


Gly Arg 


Ala 


Gin 


Lys 


Lys 


Asn 


Glu Arg 


Met 


His 


Val 


Leu 




290 










295 










300 










Lys 


Lys 


Gin 


Tyr 


Glu 


Ala 


Tyr Arg 


Leu 


Glu 


Lys 


Met 


Ala 


Lys 


Tyr 


Gin 


305 










310 










315 










320 


Gly Val 


Asn 


Leu 


Phe 


Val 


Lys 


Asn 


Leu 


Asp Asp 


Ser 


Val 


Asp 


Asp 


Glu 
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325 










330 








335 




Lys 


Leu 


Glu 


Glu 


Glu 


Phe 


Ala 


Pro 


Tyr 


Gly Thr 


lie 


Thr 


Ser Ala 


Lys 








340 










345 










350 




Val 


Met 


Arg 


Thr 


Glu 


Asn Gly 


Lys 


Ser 


Lys 


Gly Phe Gly 


Phe Val 


Cys 






355 










360 










365 






Phe 


Ser 


Thr 


Pro 


Glu 


Glu 


Ala 


Thr 


Lys Ala 


lie 


Thr 


Glu 


Lys Asn 


Gin 




370 










375 










380 








Gin 


He 


Val 


Ala 


Gly Lys 


Pro 


Leu 


Tyr 


Val 


Ala 


lie 


Ala 


Gin Arg 


Lys 


385 










390 










395 








400 


Asp 


Val 


Arg Arg 


Ser 


Gin 


Leu 


Ala 


Gin 


Gin 


lie 


Gin 


Ala 


Arg Asn 


Gin 










405 










410 








415 




Met 


Arg 


Tyr 


Gin 


Gin 


Ala 


Thr 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala Ala 


Gly 








420 










425 










430 




Met 


Pro 


Gly Gin 


Phe 


Met 


Pro 


Pro 


Met 


Phe 


Tyr 


Gly Val 


Met Pro 


Pro 






435 










440 










445 






Arg 


Gly Val 


Pro 


Phe 


Asn 


Gly 


Pro 


Asn 


Pro 


Gin 


Gin 


Met 


Asn Pro 


Met 




450 










455 










460 








Gly Gly Met 


Pro 


Lys 


Asn 


Gly 


Met 


Pro 


Pro 


Gin 


Phe 


Arg 


Asn Gly 


Pro 


465 










470 










475 








480 


Val 


Tyr 


Gly Val 


Pro 


Pro 


Gin 


Gly Gly Phe 


Pro 


Arg Asn 


Ala Asn Asp 










485 










490 








495 




Asn 


Asn 


Gin 


Phe 


Tyr 


Gin 


Gin 


Lys 


Gin Arg 


Gin Ala 


Leu 


Gly Glu 


Gin 








500 










505 










510 




Leu 


Tyr 


Lys 


Lys 


Val 


Ser 


Ala 


Lys 


Thr 


Ser 


Asn 


Glu 


Glu 


Ala Ala Gly 






515 










520 










525 






Lys 


lie 


Thr Gly 


Met 


lie 


Leu Asp 


Leu 


Pro 


Pro 


Gin 


Glu 


Val Phe 


Pro 




530 










535 










540 








Leu 


Leu 


Glu 


Ser 


Asp 


Glu 


Leu 


Phe 


Glu 


Gin 


His 


Tyr 


Lys 


Glu Ala 


Ser 


545 










550 










555 








560 


Ala 


Ala 


Tyr 


Glu 


Ser 


Phe 


Lys 


Lys 


Glu 


Gin 


Glu 


Gin 


Gin 


Thr Glu 


Gin 










565 










570 








575 
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Ala 



<210> 34 

<211> 1734 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 34 



atggctgata 


ttactgataa gacagctgaa 


caattggaaa 


acttgaatat 


tcaagatgac 


60 


caaaagcaag 


ccgccactgg 


ttcagaaagc 


caatctgttg 


aaaactcttc 


tgcatcatta 


120 


tatgttggtg 


acttagaacc 


ttctgtttcc 


gaagcccact 


tatatgatat 


cttctctcca 


180 


atcggttcag 


tctcctccat 


tcgtgtctgt 


cgtgatgcca 


tcactaagac 


ctctttgggc 


240 


tatgcttatg 


ttaactttaa 


cgaccatgaa 


gccggcagaa 


aagcaattga 


gcaattgaac 


300 


tacactccaa 


tcaagggtag attatgccgt 


attatgtggt 


ctcaacgtga 


cccatcattg 


360 


agaaagaagg 


gttctggtaa 


catctttatc 


aagaacttgc 


accctgatat 


tgacaacaag 


420 


gctttgtatg 


acactttctc 


tgtgtttggt 


gacatcttgt 


ccagcaagat 


tgccaccgac 


480 


gaaaacggaa 


aatccaaggg 


ttttgggttt 


gttcacttcg 


aagaagaagg 


tgcfcgccaag 


540 


gaagctattg 


atgctttgaa 


tggtatgctg 


ttgaacggtc 


aagaaattta 


tgttgctcct 


60 0 


cacttgtcca 


gaaaggaacg 


tgactctcaa 


ttggaagaga 


ctaaggcaca 


ttacactaac 


660 


ctttatgtga 


aaaacatcaa 


ctccgaaact 


actgacgaac 


aattccaaga 


attgtttgcc 


720 


aaatttggtc 


caattgtttc 


tgcctctttg 


gaaaaggatg 


ctgatggaaa 


attgaagggt 


780 


ttcgggtttg 


fctaactacga 


aaagcatgaa 


gacgctgtga 


aagctgttga 


agctttgaat 


840 


gactctgaac 


taaatggaga 


aaagttatac- 


gttggtcgtg 


cccaaaagaa 


gaatgaacgt 


900 


atgcatgtct 


tgaagaagca atacgaagct 


tacagattgg 


aaaaaatggc 


caagtaccaa 


960 


ggtgttaatt 


tgtttgttaa 


gaacttagat 


gacagcgttg 


atgacgaaaa 


gttggaagaa 


1020 


gaatttgctc 


catatggtac 


tatcacttct 


gcaaaggtta 


tgagaaccga 


aaacggtaag 


1080 


tctaagggtt 


ttggttttgt 


ttgtttctca 


actccagagg 


aagctactaa 


ggccattaca 


1140 


gaaaagaacc 


aacaaattgt 


tgctggtaag 


ccattatacg 


ttgccattgc 


tcaaagaaaa 


1200 


gacgtaagac 


gttctcaatt 


ggctcaacaa 


atccaagcca 


gaaatcaaat 


gagataccag 


1260 


caagctactg 


ctgccgctgc 


cgccgccgct 


gccggtatgc 


caggtcaatt 


catgcctcca 


1320 


atgttctatg 


gtgttatgcc 


accaagaggt 


gttccattca 


acggtccaaa 


cccacaacaa 


1380 


atgaacccaa 


tgggcggtat gccaaagaac 


ggcatgccac 


ctcaatttag 


aaatggtccg 


1440 


gtttacggcg 


tccccccaca 


a 99 tggtttc 


ccaagaaatg 


ccaacgataa 


caaccaattt 


1500 


tatcaacaaa 


agcaaagaca 


agctttgggt 


gaacaattat 


acaagaaggt 


ttctgctaag 


1560 


acttcaaatg 


aagaagcagc 


tggtaaaatt 


actggtatga 


ttttggattt 


gccacctcaa 


1620 
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gaggtcttcc cattgttgga aagtgatgaa ttgttcgaac aacactacaa agaagcttct 
gctgcctatg agtctttcaa aaaggagcaa gaacaacaaa ctgagcaagc ttaa 



<210> 


35 


<211> 


568 


<212> 


PRT 


<213 > 


Saccharomyces cerevisiae 


<400 > 


35 



Met Ser Ser Gin Lys Val Phe Gly lie Thr Gly Pro Val Ser Thr Val 
15 10 15 



Gly Ala Thr Ala Ala Glu Asn Lys Leu Asn Asp Ser Leu He Gin Glu 
20 25 30 



Leu Lys Lys Glu Gly Ser Phe Glu Thr Glu Gin Glu Thr Ala Asn Arg 
35 40 45 



Val Gin Val Leu Lys lie Leu Gin Glu Leu Ala Gin Arg Phe Val Tyr 
50 55 60 



Glu Val Ser Lys Lys Lys Asn Met Ser Asp Gly Met Ala Arg Asp Ala 
65 70. 75 80 



Gly Gly Lys lie Phe Thr Tyr Gly Ser Tyr Arg Leu Gly Val His Gly 
85 90 95 



Pro Gly Ser Asp lie Asp Thr Leu Val Val Val Pro Lys His Val Thr 
100 105 110. 



Arg Glu Asp Phe Phe Thr Val Phe Asp Ser Leu Leu Arg Glu Arg Lys 
115 120 125 



Glu Leu Asp Glu lie Ala Pro Val Pro Asp Ala Phe Val Pro lie lie 
130 135 140 



Lys lie Lys Phe Ser Gly lie Ser lie Asp Leu lie Cys Ala Arg Leu 
145 150 155 160 



Asp Gin Pro Gin Val Pro Leu Ser Leu Thr Leu Ser Asp Lys Asn Leu 
165 170 175 



Leu Arg Asn Leu Asp Glu Lys Asp Leu Arg Ala Leu Asn Gly Thr Arg 
180 185 190 



Val Thr Asp Glu lie Leu Glu Leu Val Pro Lys Pro Asn Val Phe Arg 
195 200 205 



1680 

1734 
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He 


Ala 


Leu Arg Ala 


lie 


Lys 


Leu 


Trp 


Ala 


Gin 


Arg Arg Ala 


Val 


Tyr 




210 










215 










220 










Ala 


Asn 


lie 


Phe 


Gly 


Phe 


Pro 


Gly Gly Val 


Ala 


Trp 


Ala 


Met 


Leu 


Val 


225 










230 










235 










240 


Ala 


Arg 


lie 


Cys 


Gin 


Leu 


Tyr 


Pro 


Asn 


Ala 


Cys 


Ser 


Ala 


Val 


lie 


Leu 










245 










250 










255 




Asn 


Arg 


Phe 


Phe 


lie 


lie 


Leu 


Ser 


Glu 


Trp Asn 


Trp 


Pro 


Gin 


Pro 


Val 








260 










265 










270 






lie 


Leu 


Lys 


Pro 


He 


Glu Asp 


Gly 


Pro 


Leu 


Gin 


Val 


Arg 


Val 


Trp Asn 






275 










28a 










285 








Pro 


Lys 


lie 


Tyr 


Ala 


Gin 


Asp 


Arg 


Ser 


His 


Arg 


Met 


Pro 


Val 


lie 


Thr 




290 










295 










300 










Pro 


Ala 


Tyr 


Pro 


Ser 


Met 


Cys 


Ala 


Thr 


His 


Asn 


lie 


Thr 


Glu 


Ser 


Thr 


305 










310 










315 










320 


Lys 


Lys 


Val 


lie 


Leu 


Gin 


Glu 


Phe 


Val 


Arg 


Gly Val 


Gin 


lie 


Thr 


Asn 










325 










330 










335 




Asp 


lie 


Phe 


Ser 


Asn 


Lys 


Lys 


Ser 


Trp Ala 


Asn 


Leu 


Phe 


Glu 


Lys 


Asn 








340 










345 










350 






Asp 


Phe 


Phe 


Phe 


Arg 


Tyr 


Lys 


Phe 


Tyr 


Leu 


Glu 


lie 


Thr 


Ala 


Tyr 


Thr 






355 










360 










365 








Arg 


Gly 


Ser Asp 


Glu 


Gin 


His 


Leu 


Lys 


Trp 


Ser 


Gly Leu 


Val 


Glu 


Ser 




370 










375 










380 










Lys 


Val 


Arg 


Leu 


Leu 


Val 


Met 


Lys 


Leu 


Glu 


Val 


Leu 


Ala 


Gly 


lie 


Lys 


385 










390 










395 










400 


lie 


Ala 


His 


Pro 


Phe 


Thr 


Lys 


Pro 


Phe 


Glu 


Ser 


Ser 


Tyr 


Cys 


Cys 


Pro 










405 










410 










415 




Thr 


Glu Asp 


Asp 


Tyr 


Glu 


Met 


lie 


Gin Asp 


Lys 


Tyr Gly 


Ser 


His 


Lys 








420 










425 










430 






Thr 


Glu 


Thr 


Ala 


Leu 


Asn 


Ala 


Leu 


Lys 


Leu 


Val 


Thr 


Asp 


Glu 


Asn 


Lys 






435 










440 










445 








Glu 


Glu 


Glu 


Ser 


He 


Lys 


Asp 


Ala 


Pro 


Lys 


Ala 


Tyr 


Leu 


Ser 


Thr 


Met 
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450 455 460 



Tyr lie Gly Leu Asp Phe Asn lie Glu Asn Lys Lys Glu Lys Val Asp 
465 470 475 480 



He His lie Pro Cys Thr Glu Phe Val Asn Leu Cys Arg Ser Phe Asn 
485 490 495 



Glu Asp Tyr Gly Asp His Lys Val Phe Asn Leu Ala Leu Arg Phe Val 
500 505 510 



Lys Gly Tyr Asp Leu Pro Asp Glu Val Phe Asp Glu Asn Glu Lys Arg 
515 520 525 



Pro Ser Lys Lys Ser Lys Arg Lys Asn Leu Asp Ala Arg His Glu Thr 
530 535 540 



Val Lys Arg Ser Lys Ser Asp Ala Ala Ser Gly Asp Asn He Asn Gly 
545 550 555 560 



Thr Thr Ala Ala Val Asp Val Asn 
565 



<210> 36 

<211> 1707 

<212> DNA 

<213> Sac char omyces cerevisiae 
<400> 36 

atgagctctc aaaaggtttt tggtattact ggacctgttt ccaccgtggg cgccacagca 60 

gcagaaaata aattaaatga tagtttaatc caagaactga aaaaggaagg atcgttcgaa 120 

acagagcaag aaactgccaa tagggtacaa gtgttgaaaa tattgcagga attggcacaa 180 

agatttgttt atgaagtatc gaagaagaaa aatatgtcag acgggatggc aagggatgct 240 

ggtgggaaga tttttacgta tgggtcctat agactaggag tccatgggcc tggtagtgat 300 

atcgatactt tggtagttgt tccaaaacat gtaactcggg aagatttttt tacggtattt 360 

gattcactac tgagagagag gaaggaactg gatgaaattg cacctgtacc tgatgcgttt 420 

gtcccgatta tcaagataaa gttcagtggt atttctatcg atttaatctg tgcacgtcta 480 

gaccaacctc aagtgccttt atccttgact ttatcagata aaaatctact gcgaaatcta 540 

gacgagaagg acttgagagc tttgaatggt accagagtaa cagatgagat attagaactg 600 

gtaccaaagc cgaatgtttt cagaatcgct ttaagagcta ttaagctatg ggcccaaaga 660 

720 



agggctgttt atgctaatat ttttggtttt cctggtggtg tggcttgggc catgctagtg 
gctagaattt gtcaactata ccctaacgcc tgtagcgcag ttatattgaa cagatttttc 



780 
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atcattttgt 


cggaatggaa 


ttggccacaa 


cctgttatct 


tgaaaccaat 


tgaggatggc 


840 


ccgttacaag 


ttcgtgtatg 


gaatccaaag 


atatatgccc 


aagacaggtc 


tcatagaatg 


900 


cccgtcatta 


caccagctta 


tccatcaatg 


tgtgctaccc 


ataacatcac 


ggaatctact 


960 


aaaaaagtca 


ttttacagga 


attcgtaaga 


ggcgttcaaa 


ttacgaatga 


tattttttcc 


1020 


aataagaagt 


cctgggccaa 


tttattcgaa 


aaaaacgatt 


ttttctttcg 


atacaagttc 


1080 


tatttagaaa 


ttactgcata 


tacaaggggc 


agtgacgagc 


agcatttaaa 


atggagtggt 


1140 


cttgttgaaa 


gtaaggtaag 


gcttctagtt 


atgaaactgg 


aggtgttagc 


tggaataaaa 


1200 


attgcacatc 


ctttcaccaa 


accctttgaa 


agtagttatt 


gttgtccaac 


cgaggatgac 


1260 


tatgaaatga 


ttcaagacaa 


atacggtagt 


cataaaactg 


agacagcact 


gaacgccctt 


1320 


aaactggtaa 


cagatgaaaa 


taaagaggaa 


gaaagtatta 


aagatgcacc 


aaaggcatat 


1380 


ttaagcacca 


tgtacatagg 


ccttgacttt 


aatattgaaa 


^acaaaaagga 


aaaagttgac 


1440 


attcacattc 


cctgcactga 


atttgtgaat 


ttatgtcgaa 


gtttcaatga 


ggattatggt 


1500 


gaccacaaag 


tattcaatct 


agccctccgc 


ttcgtaaagg 


gttacgattt 


gccagatgaa 


1560 


gttttcgatg 


aaaatgaaaa 


gagaccatca 


aagaagagta 


aaaggaagaa 


tttagatgct 


1620 


agacatgaaa 


ccgtgaagag 


atctaaatca 


gatgctgctt 


caggtgacaa 


catcaatggc 


1680 


acaaccgcag 


ctgttgacgt 


aaactaa 








1707 



<210> 37 

<211> 626 
<2 12> PRT 

<213> Saccharomyces cerevisiae 
<400> 37 

Met Asp His Asp Thr Glu Val lie Val Lys Asp Phe Asn Ser lie Leu 
15 10 15 

Glu Glu Leu Thr Phe Asn Ser Arg Pro lie lie Thr Thr Leu Thr Lys 
20 25 30 

Leu Ala Glu Glu Asn lie Ser Cys Ala Gin Tyr Phe Val Asp Ala lie 
35 40 45 

Glu Ser Arg He Glu Lys Cys Met Pro Lys Gin Lys Leu Tyr Ala Phe 
50 55 60 

Tyr Ala Leu Asp Ser lie Cys Lys Asn Val Gly Ser Pro Tyr Thr lie 

65 70 75 80 

Tyr Phe Ser Arg Asn Leu Phe Asn Leu Tyr Lys Arg Thr Tyr Leu Leu 

85 90 95 
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Val 


Asp 


Asn 


Thr 


Thr 


Arg 


Thr 


Lys 


Leu 


lie 


Asn 


Met 


Phe 


Lys 


Leu 


Trp 








100 










105 










110 






Leu 


Asn 


Pro 


Asn Asp 


Thr 


Gly Leu 


Pro 


Leu 


Phe 


Glu Gly 


Ser 


Ala 


Leu 






115 










120 










125 








Glu 


Lys 


lie 


Glu 


Gin 


Phe 


Leu 


lie 


Lys 


Ala 


Ser 


Ala 


Leu 


His 


Gin 


Lys 




130 










135 










140 










Asn 


Leu 


Gin 


Ala 


Met 


Leu 


Pro 


Thr 


Pro 


Thr 


Val 


Pro 


Leu 


Leu 


Leu Arg 


145 










150 










155 










160 


Asp 


lie 


Asp 


Lys 


Leu 


Thr 


Cys 


Leu 


Thr 


Ser 


Glu Arg 


Leu 


Lys 


Asn 


Gin 










165 










170 










175 




Pro 


Asn Asp 


Glu 


Lys 


Leu 


Lys 


Met 


Lys 


Leu 


Leu 


Val 


Leu 


Ser 


Gin 


Leu 








180 










185 










190 






Lys 


Gin 


Glu 


Leu 


Lys 


Arg 


Glu 


Lys 


Leu 


Thr 


Leu 


Asn 


Ala 


Leu 


Lys 


Gin 






195 










200 










205 








Val 


Gin 


Met 


Gin 


Leu 


Arg 


Gin 


Val 


Phe 


Ser 


Gin Asp 


Gin 


Gin 


Val 


Leu 




210 










215 










220 










Gin 


Glu 


Arg 


Met 


Arg 


Tyr 


His 


Glu 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


225 










230 










235 










240 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Tyr 


His 


Glu 










245 










250 










255 




Thr 


Lys 


Asp 


Met 


Val 


Gly 


Ser 


Tyr 


Thr 


Gin 


Asn 


Ser 


Asn 


Ser 


Ala 


lie 








260 










265 










270 






Pro 


Leu 


Phe 


Gly Asn 


Asn 


Ser Asp 


Thr 


Thr 


Asn 


Gin 


Gin 


Asn 


Ser 


Leu 






275 










280 










285 








Ser 


Ser 


Ser 


Leu 


Phe 


Gly Asn 


lie 


Ser 


Gly Val 


Glu 


Ser 


Phe 


Gin 


Glu 




290 










295 










300 










lie 


Glu 


Lys 


Lys 


Lys 


Ser 


Leu 


Asn 


Lys 


lie 


Asn 


Asn 


Leu 


Tyr Ala 


Ser 


305 










310 










315 










320 


Leu 


Lys 


Ala 


Glu Gly 


Leu 


He 


Tyr 


Thr 


Pro 


Pro 


Lys 


Glu 


Ser 


lie 


Val 










325 










330 










335 




Thr 


lieu 


Tyr 


Lys 


Lys 


Leu 


Asn Gly 


His 


Ser 


Asn 


Tyr 


Ser 


Leu Asp 


Ser 








340 










345 










350 
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His Glu Lys Gin Leu Met Lys Asn Leu Pro Lys lie Pro Leu Leu Asn 
355 360 365 



Asp lie Leu Ser Asp Cys Lys Ala Tyr Phe Ala Thr Val Asn lie Asp 
370 375 380 



Val Leu Asn Asn Pro Ser Leu Gin Leu Ser Glu Gin Thr Leu Leu Gin 
385 390 395 400 



Glu Asn Pro lie Val Gin Asn Asn Leu lie His Leu Leu Tyr Arg Ser 
405 410 415 



Lys Pro Asn Lys Cys Ser Val Cys Gly Lys Arg Phe Gly Asn Ser Glu 
420 425 430 



Ser Glu Lys Leu Leu Gin Asn Glu His Leu Asp Trp His Phe Arg lie 
435 440 445 



Asn Thr Arg lie Lys Gly Ser Gin Asn Thr Ala Asn Thr Gly lie Ser 
450 455 460 



Asn Ser Asn Leu Asn Thr Thr Thr Thr Arg Lys Asn lie Gin Ser Arg 
465 470 475 480 



Asn Trp Tyr Leu Ser Asp Ser Gin Trp Ala Ala Phe Lys Asp Asp Glu 
485 490 495 



lie Thr Ser Thr Lys His Lys Asn Asp Tyr Thr Asp Pro His Ala Asn 
500 505 510 



Lys Asn lie Asp Lys Ser Ala Leu Asn lie His Ala Asp Glu Asn Asp 
515 520 525 



Glu Gly Ser Val Asp Asn Thr Leu Gly Ser Asp Arg Ser Asn Glu Leu 
530 535 540 



Glu lie Arg Gly Lys Tyr Val Val Val Pro Glu Thr Ser Gin Asp Met 
545 550 555 560 



Ala Phe Lys Cys Pro lie Cys Lys Glu Thr Val Thr Gly Val Tyr Asp 
565 570 575 



Glu Glu Ser Gly Glu Trp Val Trp Lys Asn Thr lie Glu Val Asn Gly 
580 585 590 



Lys Tyr Phe His Ser Thr Cys Tyr His Glu Thr Ser Gin Asn Ser Ser 
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595 600 605 

Lys Ser Asn Ser Gly Lys Val Gly Leu Asp Asp Leu Lys Lys Leu Val 
610 615 620 

Thr Lys 
625 

<210> 38 

<211> 1881 
<212 > DNA 

<213 > Saccharomyces cerevisiae 



<400> 38 

atggatcacg 


acacagaagt 


tatagtcaag 


gatttcaaca 


gtattcttga 


agaattaact 


60 


tttaactcga 


gacccattat 


taccactctc 


acgaagttag 


cagaagaaaa 


catttcgtgt 


120 


gctcaatatt 


ttgttgacgc 


cattgagagt 


agaattgaaa 


aatgtatgcc 


taagcagaag 


180 


ctatacgctt 


tttatgcctt 


ggactccatt 


tgtaagaatg 


ttggcagccc 


atacacaatc 


240 


tacttcagta 


gaaatttgtt 


taacttgtac 


aaaagaacat 


acttattggt 


ggacaacacc 


300 


acaagaacaa 


aactcatcaa 


tatgtttaag 


ctttggttga 


atcctaatga 


caccggcctg 


360 


cctctgtttg 


aaggctcagc 


actggagaaa 


attgaacagt 


ttttaataaa 


ggccagtgct 


420 


ctccaccaga 


aaaacctgca 


agcaatgtta 


ccaactccga 


ctgttcccct 


cctattaaga 


480 


gacattgata 


aattaacttg 


tttaactagt 


gaaaggctaa 


agaatcaacc 


aaatgatgaa 


540 


aaattaaaaa 


tgaaactgtt 


ggttttatct 


cagttgaaac 


aagaactgaa 


aagagaaaaa 


600 


ctaactctca 


atgctctgaa 


acaagttcaa 


atgcaactaa 


ggcaggtctt 


ctcacaggac 


660 


caacaagttt 


tgcaggaaag 


aatgaggtat 


cacgaacttc 


agcagcaaca 


gcaacaacag 


720 


cagcaacagc 


aacaacaaca 


acagcagcag 


cagcagcagt 


atcacgaaac 


taaggatatg 


780 


gtcggttctt 


acacacaaaa 


ctctaattca 


gcgattccat 


tgtttggaaa 


taattcagat 


840 


acaacgaacc 


aacagaattc 


tttatcttcc 


tctctttttg 


gtaatatttc 


tggcgttgaa 


900 


tcttttcaag 


aaattgagaa 


aaaaaagtcc 


ctaaacaaaa 


tcaataacct 


ctatgcatct 


960 


cttaaggctg 


aagggctcat 


atatactcca 


ccaaaagaat 


ccatcgtaac 


gctttataaa 


1020 


aagctaaatg 


gacactcaaa 


ctactctctt 


gattcacacg 


aaaaacaatt 


gatgaagaac 


1080 


ctgccaaaga 


tacccttatt 


aaatgatatt 


ctttctgact 


gcaaagctta 


ttttgccacg 


1140 


gttaacatcg 


acgttttgaa 


taacccatcc 


ctgcagctgt 


ccgaacaaac 


tcttcttcaa 


1200 


gaaaatccca 


tcgtccaaaa 


taatcttatt 


cacttattat 


accgatccaa 


gccaaataaa 


1260 


tgtagcgtgt 


gtggtaagag 


atttggcaac 


tcagaaagtg 


aaaaactact 


acaaaatgag 


1320 


catttggact 


ggcatttcag 


aataaataca 


agaattaagg 


gatcacaaaa 


cacagcaaat 


1380 




WO 02/092626 



61/148 



PC T/EP02/05359 



acaggcataa gcaactctaa cttaaataca actaccacaa ggaaaaatat tcaatcaaga 1440 
aactggtatt tgagtgactc tcaatgggca gccttcaaag atgatgaaat tacctccaca 1500 
aagcataaaa atgattatac tgatccgcat gccaataaaa atatcgataa aagtgcccta 1560 
aatattcacg ctgatgaaaa tgacgaaggc tctgtagaca acactttagg aagtgataga 1620 
agtaacgagt tggaaatacg gggaaaatat gttgttgtac cagaaacttc acaagatatg 1680 
gccttcaaat gtccaatttg taaggaaacc gttactggcg tttatgatga agaatctggt 1740 
gaatgggttt ggaaaaatac tattgaagtg aatggcaaat atttccactc gacatgttat 1800 
catgaaacgt cgcaaaattc aagcaaatct aatagtggca aggtcggttt ggatgactta 1860 
aagaaattgg tcacaaaata a 1881 



<210> 


39 


<211> 


563 


<212> 


PRT 


<213> 


Saccharomyces cerevisiae 


<40 0> 


39 



Met Ser Glu lie Thr Leu Gly Lys Tyr Leu Phe Glu Arg Leu Lys Gin 
15 10 15 



Val Asn Val Asn Thr Val Phe Gly Leu Pro Gly Asp Phe Asn Leu Ser 
20 25 30 



Leu Leu Asp Lys lie Tyr Glu Val Glu Gly Met Arg Trp Ala Gly Asn 
35 40 45 



Ala Asn Glu Leu Asn Ala Ala Tyr Ala Ala Asp Gly Tyr Ala Arg lie 
50 55 60 



Lys Gly Met Ser Cys lie lie Thr Thr Phe Gly Val Gly Glu Leu Ser 
65 70 75 80 



Ala Leu Asn Gly lie Ala Gly Ser Tyr Ala Glu His Val Gly Val Leu 
85 90 95 



His Val Val Gly Val Pro Ser lie Ser Ala Gin Ala Lys Gin Leu Leu 
100 105 110 



Leu His His Thr Leu Gly Asn Gly Asp Phe Thr Val Phe His Arg Met 
115 120 125 



Ser Ala Asn lie Ser Glu Thr Thr Ala Met lie Thr Asp lie Ala Thr 
130 135 140 



Ala Pro Ala Glu lie Asp Arg Cys lie Arg Thr Thr Tyr Val Thr Gin 
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145 150 155 160 



Arg Pro Val Tyr Leu Gly Leu Pro Ala Asn Leu Val Asp Leu Asn Val 
165 170 175 



Pro Ala Lys Leu Leu Gin Thr Pro lie Asp Met Ser Leu Lys Pro Asn 
180 185 190 



Asp Ala Glu Ser Glu Lys Glu Val lie Asp Thr He Leu Ala Leu Val 
195 200 205 



Lys Asp Ala Lys Asn Pro Val lie Leu Ala Asp Ala Cys Cys Ser Arg 
210 ' 215 220 



His Asp Val Lys Ala Glu Thr Lys Lys Leu lie Asp Leu Thr Gin Phe 
225 230 235 240 



Pro Ala Phe Val Thr Pro Met Gly Lys Gly Ser lie Asp Glu Gin His 
245 250 255 



Pro Arg Tyr Gly Gly Val Tyr Val Gly Thr Leu Ser Lys Pro Glu Val 
260 265 270 



Lys Glu Ala Val Glu Ser Ala Asp Leu lie Leu Ser Val Gly Ala Leu 
275 280 285 



Leu Ser Asp Phe Asn Thr Gly Ser Phe Ser Tyr Ser Tyr Lys Thr Lys 
290 295 300 



Asn lie Val Glu Phe His Ser Asp His Met Lys lie Arg Asn Ala Thr 
305 310 315 320 



Phe Pro Gly Val Gin Met Lys Phe Val Leu Gin Lys Leu Leu Thr Thr 
325 330 335 



lie Ala Asp Ala Ala Lys Gly Tyr Lys Pro Val Ala Val Pro Ala Arg 
340 345 350 



Thr Pro Ala Asn Ala Ala Val Pro Ala Ser Thr Pro Leu Lys Gin Glu 
355 360 365 



Trp Met Trp Asn Gin Leu Gly Asn Phe Leu Gin Glu Gly Asp Val Val 
370 375 380 



lie Ala Glu Thr Gly Thr Ser Ala Phe Gly lie Asn Gin Thr Thr Phe 
385 390 395 400 
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Pro Asn Asn Thr Tyr Gly lie Ser Gin Val Leu Trp Gly Ser lie Gly 
405 410 415 



Phe Thr Thr Gly Ala Thr Leu Gly Ala Ala Phe Ala Ala Glu Glu lie 
420 425 430 



Asp Pro Lys Lys Arg Val lie Leu Phe lie Gly Asp Gly Ser Leu Gin 
435 440 445 



Leu Thr Val Gin Glu lie Ser Thr Met He Arg Trp Gly Leu Lys Pro 
450 455 460 



Tyr Leu Phe Val Leu Asn Asn Asp Gly Tyr Thr lie Glu Lys Leu lie 
465 470 475 480 



His Gly Pro Lys Ala Gin Tyr Asn Glu lie Gin Gly Trp Asp His Leu 
485 490 495 



Ser Leu Leu Pro Thr Phe Gly Ala Lys Asp Tyr Glu Thr His Arg Val 
500 505 510 



Ala Thr Thr Gly Glu Trp Asp Lys Leu Thr Gin Asp Lys Ser Phe Asn 
515 520 525 



Asp Asn Ser Lys lie Arg Met lie Glu lie Met Leu Pro Val Phe Asp 
530 535 540 



Ala Pro Gin Asn Leu Val Glu Gin Ala Lys Leu Thr Ala Ala Thr Asn 
545 550 555 560 



Ala Lys Gin 



<210> 40 

<211> 1692 
<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 40 

atgtctgaaa ttactttggg taaatatttg ttcgaaagat taaagcaagt caacgttaac 
accgttttcg gtttgccagg tgacttcaac ttgtccttgt tggacaagat ctacgaagtt 
gaaggtatga gatgggctgg taacgccaac gaattgaacg ctgcttacgc cgctgatggt 
tacgctcgta tcaagggtat gtcttgtatc atcaccacct tcggtgtcgg tgaattgtct 
gctttgaacg gtattgccgg ttcttacgct gaacacgtcg gtgttttgca cgttgttggt 
gtcccatcca tctctgctca agctaagcaa ttgttgttgc accacacctt gggtaacggt 



60 

120 

180 

240 

300 

360 



gacttcactg ttttccacag aatgtctgcc aacatttctg aaaccactgc tatgatcact 420 
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gacattgcta 


ccgccccagc 


tgaaattgac 


agatgtatca 


gaaccactta 


cgtcacccaa 


480 


agaccagtct 


acttaggttt 


gccagctaac 


ttggtcgact 


tgaacgtccc 


agctaagttg 


540 


ttgcaaactc 


caattgacat 


gtctttgaag 


ccaaacgatg 


ctgaatccga 


aaaggaagtc 


600 


attgacacca 


tcttggcttt 


ggtcaaggat 


gctaagaacc 


cagttatctt 


ggctgatgct 


660 


tgttgttcca 


gacacgacgt 


caaggctgaa 


actaagaagt 


tgattgactt 


gactcaattc 


720 


ccagctttcg 


tcaccccaat 


gggtaagggt 


tccattgacg 


aacaacaccc 


aagatacggt 


780 


ggtgtttacg 


tcggtacctt 


gtccaagcca 


gaagttaagg 


aagccgttga 


atctgctgac 


840 


ttgattttgt 


ctgtcggtgc 


tttgttgtct 


gatttcaaca 


ccggttcttt 


ctcttactct 


900 


tacaagacca 


agaacattgt 


cgaattccac 


tccgaccaca 


tgaagatcag 


aaacgccact 


960 


ttcccaggtg 


tccaaatgaa 


attcgttttg 


caaaagttgt 


tgaccactat 


tgctgacgcc 


1020 


gctaagggtt 


acaagccagt 


tgctgtccca 


gctagaactc 


cagctaacgc 


tgctgtccca 


1080 


gcttctaccc 


cattgaagca 


agaatggatg 


tggaaccaat 


tgggtaactt 


cttgcaagaa 


1140 


ggtgatgttg 


tcattgctga 


aaccggtacc 


tccgctttcg 


gtatcaacca 


aaccactttc 


1200 


ccaaacaaca 


cctacggtat 


ctctcaagtc 


ttatggggtt 


ccattggttt 


caccactggt 


1260 


gctaccttgg 


gtgctgcttfc 


cgctgctgaa 


gaaattgatc 


caaagaagag 


agttatctta 


1320 


ttcattggtg 


acggttcttt 


gcaattgact 


gttcaagaaa 


tctccaccat 


gafccagatgg 


1380 


ggcttgaagc 


catacttgtt 


cgtcttgaac 


aacgatggtt 


acaccattga 


aaagttgatt 


1440 


cacggtccaa 


aggctcaata 


caacgaaatt 


caaggttggg 


accacctatc 


cttgttgcca 


1500 


actttcggtg 


ctaaggacta 


tgaaacccac 


agagtcgcta 


ccaccggtga 


atgggacaag 


1560 


ttgacccaag 


acaagtcttt 


caacgacaac 


tctaagatca 


gaatgattga 


aatcatgttg 


1620 


ccagtcttcg 


atgctccaca 


aaacttggtt 


gaacaagcta 


agttgactgc 


tgctaccaac 


1680 


gctaagcaat 


aa 










1692 



<210> 41 

<211> 987 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 41 

Met Gin Ser Gin Asp Ser Cys Tyr Gly Val Ala Phe Arg Ser He lie 
15 10 15 

Thr Asn Asp Glu Ala Leu Phe Lys Lys Thr lie His Phe Tyr His Thr 
20 25 30 

Leu Gly Phe Ala Thr Val Lys Asp Phe Asn Lys Phe Lys His Gly Glu 
35 40 45 
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Asn 


Ser 


Leu 


Leu 


Ser 


Ser 


Gly Thr 


Ser 


Gin Asp 


Ser 


Leu Arg 


Glu 


Val 




50 










55 








60 










Trp 


Leu 


Glu 


Ser 


Phe 


Lys 


Leu 


Ser 


Glu 


Val Asp Ala 


Ser 


Gly 


Phe 


Arg 


65 










70 








75 










80 


He 


Pro 


Gin 


Gin 


Glu 


Ala 


Thr 


Asn 


Lys 


Ala Gin 


Ser 


Gin 


Gly Ala 


Leu 










85 










90 








95 




Leu 


Lys 


lie 


Arg 


Leu 


Val 


Met 


Ser 


Ala 


Pro lie Asp 


Glu 


Thr 


Phe 


Asp 








100 










105 








110 






Thr 


Asn 


Glu 


Thr 


Ala 


Thr 


lie 


Thr 


Tyr 


Phe Ser 


Thr 


Asp 


Leu 


Asn 


Lys 






115 










120 








125 








lie 


Val 


Glu 


Lys 


Phe 


Pro 


Lys 


Gin 


Ala 


Glu Lys 


Leu 


Ser 


Asp 


Thr 


Leu 




13 0 










135 








140 










Val 


Phe 


Leu 


Lys 


Asp 


Pro 


Met 


Gly Asn 


Asn lie 


Thr 


Phe 


Ser 


Gly 


Leu 


145 










150 








155 










160 


Ala 


Asn 


Ala 


Thr 


Asp 


Ser 


Ala 


Pro 


Thr 


Ser Lys 


Asp 


Ala 


Phe 


Leu 


Glu 










165 










170 








175 




Ala 


Thr 


Ser 


Glu 


Asp 


Glu 


He 


lie 


Ser 


Arg Ala 


Ser 


Ser 


Asp 


Ala 


Ser 








180 










185 








190 






Asp 


Leu 


Leu 


Arg 


Gin 


Thr 


Leu 


Gly 


Ser 


Ser Gin 


Lys 


Lys 


Lys 


Lys 


lie 






195 










200 








205 








Ala 


Val 


Met 


Thr 


Ser Gly Gly Asp 


Ser 


Pro Gly Met 


Asn 


Ala 


Ala 


Val 




210 










215 








220 










Arg Ala 


Val 


Val 


Arg 


Thr 


Gly 


lie 


His 


Phe Gly Cys Asp 


Val 


Phe 


Ala 


225 










230 








235 










240 


Val 


Tyr 


Glu Gly 


Tyr 


Glu 


Gly Leu 


Leu Arg Gly Gly Lys 


Tyr 


Leu 


Lys 










245 










250 








255 




Lys 


Met 


Ala 


Trp 


Glu Asp 


Val 


Arg 


Gly Trp Leu 


Ser 


Glu Gly Gly 


Thr 








260 










265 








270 






Leu 


lie 


Gly Thr 


Ala 


Arg 


Ser 


Met 


Glu 


Phe Arg 


Lys 


Arg 


Glu Gly Arg 






275 










280 








285 








Arg 


Gin 


Ala 


Ala 


Gly Asn 


Leu 


lie 


Ser 


Gin Gly 


lie 


Asp 


Ala 


Leu 


Val 
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290 295 300 



Val Cys Gly Gly Asp Gly Ser Leu Thr Gly Ala Asp Leu Phe Arg His 
305 310 315 320 



Glu Trp Pro Ser Leu Val Asp Glu Leu Val Ala Glu Gly Arg Phe Thr 
325 330 335 



Lys Glu Glu Val Ala Pro Tyr Lys Asn Leu Ser lie Val Gly Leu Val 
340 345 350 



Gly Ser lie Asp Asn Asp Met Ser Gly Thr Asp Ser Thr lie Gly Ala 
355 360 365 



Tyr Ser Ala Leu Glu Arg lie Cys Glu Met Val Asp Tyr lie Asp Ala 
370 375 380 



Thr Ala Lys Ser His Ser Arg Ala Phe Val Val Glu Val Met Gly Arg 
385 390 395 400 



His Cys Gly Trp Leu Ala Leu Met Ala Gly lie Ala Thr Gly Ala Asp 
405 410 415 



Tyr lie Phe lie Pro Glu Arg Ala Val Pro His Gly Lys Trp Gin Asp 
420 425 430 



Glu Leu Lys Glu Val Cys Gin Arg His Arg Ser Lys Gly Arg Arg Asn 
435 440 445 



Asn Thr lie lie Val Ala Glu Gly Ala Leu Asp Asp Gin Leu Asn Pro 
450 455 460 



Val Thr Ala Asn Asp Val Lys Asp Ala Leu lie Glu Leu Gly Leu Asp 
465 470 475 480 



Thr Lys Val Thr lie Leu Gly His Val Gin Arg Gly Gly Thr Ala Val 
485 490 495 



Ala His Asp Arg Trp Leu Ala Thr Leu Gin Gly Val Asp Ala Val Lys 
500 505 510 



Ala Val Leu Glu Phe Thr Pro Glu Thr Pro Ser Pro Leu lie Gly lie 
515 520 525 



Leu Glu Asn Lys lie lie Arg Met Pro Leu Val Glu Ser Val Lys Leu 
530 535 540 
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Thr Lys Ser Val Ala Thr Ala lie Glu Asn Lys Asp Phe Asp Lys Ala 
545 550 555 560 



He Ser Leu Arg Asp Thr Glu Phe lie Glu Leu Tyr Glu Asn Phe Leu 
565 570 575 



Ser Thr Thr Val Lys Asp Asp Gly Ser Glu Leu Leu Pro Val Ser Asp 
580 585 590 



Arg Leu Asn lie Gly lie Val His Val Gly Ala Pro Ser Ala Ala Leu 
595 600 605 



Asn Ala Ala Thr Arg Ala Ala Thr Leu Tyr Cys Leu Ser His Gly His 
610 615 620 



Lys Pro Tyr Ala lie Met Asn Gly Phe Ser Gly Leu lie Gin Thr Gly 
625 630 635 640 



Glu Val Lys Glu Leu Ser Trp lie Asp Val Glu Asn Trp His Asn Leu 
645 650 655 



Gly Gly Ser Glu lie Gly Thr Asn Arg Ser Val Ala Ser Glu Asp Leu 
660 665 670 



Gly Thr lie Ala Tyr Tyr Phe Gin Lys Asn Lys Leu Asp Gly Leu lie 
675 680 685 



lie Leu Gly Gly Phe Glu Gly Phe Arg Ser Leu Lys Gin Leu Arg Asp 
690 695 700 



Gly Arg Thr Gin His Pro lie Phe Asn lie Pro Met Cys Leu He Pro 
705 710 715 720 



Ala Thr Val Ser Asn Asn Val Pro Gly Thr Glu Tyr Ser Leu Gly Val 
725 730 735 



Asp Thr Cys Leu Asn Ala Leu Val Asn Tyr Thr Asp Asp lie Lys Gin 
740 745 750 



Ser Ala Ser Ala Thr Arg Arg Arg Val Phe Val Cys Glu Val Gin Gly 
755 760 765 



Gly His Ser Gly Tyr lie Ala Ser Phe Thr Gly Leu lie Thr Gly Ala 
770 775 780 



Val Ser Val Tyr Thr Pro Glu Lys Lys lie Asp Leu Ala Ser lie Arg 
785 790 795 800 
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Glu Asp He Thr Leu Leu Lys Glu Asn Phe Arg His Asp Lys Gly Glu 
805 810 815 



Asn Arg Asn Gly Lys Leu Leu Val Arg Asn Glu Gin Ala Ser Ser Val 
820 825 830 



Tyr Ser Thr Gin Leu Leu Ala Asp lie lie Ser Glu Ala Ser Lys Gly 
835 840 845 



Lys Phe Gly Val Arg Thr Ala lie Pro Gly His Val Gin Gin Gly Gly 
850 855 860 



Val Pro Ser Ser Lys Asp Arg Val Thr Ala Ser Arg Phe Ala Val Lys 
865 870 875 880 



Cys lie Lys Phe lie Glu Gin Trp Asn Lys Lys Asn Glu Ala Ser Pro 
885 890 895 



Asn Thr Asp Ala Lys Val Leu Arg Phe Lys Phe Asp Thr His Gly Glu 
900 905 910 



Lys Val Pro Thr Val Glu His Glu Asp Asp Ser Ala Ala Val He Cys 
915 920 925 



Val Asn Gly Ser His Val Ser Phe Lys Pro lie Ala Asn Leu Trp Glu 
930 935 940 



Asn Glu Thr Asn Val Glu Leu Arg Lys Gly Phe Glu Val His Trp Ala 
945 95Q 955 960 



Glu Tyr Asn Lys lie Gly Asp lie Leu Ser Gly Arg Leu Lys Leu Arg 
965 970 975 



Ala Glu Val Ala Ala Leu Ala Ala Glu Asn Lys 
980 985 



<210> 42 

< 211 > 2000 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 42 

gatcacagtt tttcgaggat agcactgatc tatacaataa aaggagctat tttcatcata 60 

atgaaagaga tgtatttgta ttcattttcg cacttgccac cattttacta attgtttgct 120 

catgttatgt tacgcctcta tatcgtatgc atcacaagat gggaacttgg aattggtcct 180 

cggcgttaga ttgcgccttc attggtgcct tctcaattga atttatcgtg aaaacagtag 240 




WO 02/092626 



69/148 



PCT/EP02/05359 



ctgacggatt 


tatatattct 


ccaaatgctt 


acctgaggaa 


tccatggaac 


fcttattgatt 


300 


tttgtgtcct 


aatctcaatg 


tggattaatt 


taattgcata 


cctaaaaaac 


aatggaaatt 


360 


tgtctaggat 


tttcaaggga 


ttgacagccc 


tgagggccct 


cagatgcctc 


acgatcagta 


420 


acacagctcg 


tcaaacattt 


aacctagtta 


tgtttgatgg 


tttaaataaa 


atttttgaag 


480 


ctgggttgat 


ttcactcagt 


ttgctatttc 


catttacagt 


ttggggctta 


agcattttta 


540 


aaggccgttt 


aggtacttgc 


aatgacggaa 


gtttgggccg 


tgcagattgt 


tacaatgaat 


600 


attcaaattc 


cgtttttcaa 


tgggatatca 


tgtctccaag 


ggtttaccag 


caaccatatc 


660 


ttcatttgga 


ctctttcgca 


agcgctttta 


gttcattata 


ccaaatcatt 


tctttggaag 


720 


gatgggttga 


tttgttggaa 


aatatgatga 


atagttcagg 


aataggtaca 


cccgctacgg 


780 


taatgggttc 


agcagggaat 


gctttattcc 


tcgttctgtt 


taatttttta 


agtatggttt 


840 


tcatcctgaa 


cttgtttgtt 


tcattcattg 


ttaacaacca 


agcaaggaca 


acaggaagcg 


900 


cttactttac 


cattgaggaa 


l 

aaggcgtggc 


tggaatccca 


gaaactttta 


tctcaggcca 


960 


agccaaaagc 


tatcccaaat 


ttaattgagt 


tatcaagagt 


taggcaattt 


ttctatcaac 


1020 


ttgcagtgga 


gaaaaaaaat 


ttctactacg 


catcgtttct 


tcaggtagta 


ctttatttgc 


1080 


acataatcat 


gctcctgagt 


cgaagctaca 


atccaggaaa 


cttgataggt 


tatcaaggtg 


1140 


tttattttat 


gttttccact 


agtgtttttt 


taattcaaga 


ggcacttcac 


atgtgcggtg 


1200 


aaggaccaag 


attatatttt 


aggcaaaaat 


ggaacagcat 


acgactcagt 


atcataatta 


1260 


tagcctttat 


tatgaacgct 


gtagcattcc 


acgttccagc 


ctctcactat 


tggttccaca 


1320 


atataaaggg 


gtttttcctg 


ttagtgatat 


ttttgtttat 


tattcctcaa 


aatgacacac 


1380 


taactgaact 


attagaaacc 


gcaatggcaa 


gcttaccgcc 


tattctatca 


ttgacctaca 


1440 


cttggggggt 


tttattttta 


gtatatgcta 


ttgctttgaa 


tcaaatcttc 


ggcctaacaa 


1500 


ggttagggag 


taatacgacc 


gataacataa 


attttagaac 


tgtaatcaaa 


tccatgattg 


1560 


ttctgtttag 


atgtagtttt 


ggtgagggct 


ggaattatat 


catggccgac 


ctaactgtgt 


1620 


cagaacctta 


ttgctcctct 


gatgataatt 


caacctatac 


ggactgtgga 


tcagagacat 


1680 


atgcctattt 


gttattaatg 


tcgtggaata 


ttatttccat 


gtatattttt 


gtgaatatgt 


1740 


ttgtttcgtt 


gattattggt 


aatttcagtt 


atgtttaccg 


tagcggtgga 


tctcgctctg 


1800 


gcatcaacag 


atcggagata 


aaaaaataca 


ttgaagcttg 


gtccaaattt gatactgatg 


1860 


gaactggcga 


gcttgagctg 


tcctacctcc 


caagaataat 


gcattcattt 


gacggtcctc 


1920 


tttcatttaa 


aatttgggaa 


ggtagattga 


caataaaaag 


tctagtcgag aactacatgg 


1980 


aggttaaccc 


agatgatcca 










2000 



<210> 43 

<211> 465 
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<212> PRT 




























<213> Saccharomyces cerevisiae 


















<400> 43 




























Met 


Asp 


Gly 


His 


Asn 


Gin 


Asn 


Gin 


Tyr 


Gin 


Asn 


Gin 


Asn 


Gin 


lie 


Gin 


1 








5 










10 










15 




Gin 


Ser 


Gin 


Gin 


Pro 


Pro 


Leu 


Lys 


Lys 


Tyr 


Val 


Thr 


Gin 


Arg 


Arg 


Ser 








20 










25 










30 






Val 


Asp 


Val 


Ser 


Ser 


Pro 


Tyr 


lie 


Asn 


Leu 


Tyr 


Tyr 


Asn 


Arg 


Arg 


His 






35 










40 










45 








Gly 


Leu 


Pro 


Asn 


Leu 


Val 


Val 


Glu 


Pro 


Glu 


Thr 


Ser 


Tyr 


Thr 


lie 


Asp 




50 










55 










60 










lie 


Met 


Pro 


Pro 


Asn 


Ala 


Tyr 


Arg Gly Arg Asp Arg 


Val 


lie 


Asn 


Leu 


65 










70 










75 










80 


Pro 


Ser 


Lys 


Phe 


Thr 


His 


Leu 


Ser 


Ser 


Asn 


Lys 


Val 


Lys 


His 


Val 


He 










85 










90 










95 




Pro 


Ala 


lie 


Gin 


Trp 


Thr 


Pro 


Glu 


Gly Arg Arg 


Leu 


Val 


Val 


Ala 


Thr 








100 










105 










110 






Tyr 


Ser 


Gly 


Glu 


Phe 


Ser 


Leu 


Trp 


Asn 


Ala 


Ser 


Ser 


Phe 


Thr 


Phe 


Glu 






115 










120 










125 








Thr 


Leu 


Met 


Gin 


Ala 


His 


Asp 


Ser 


Ala 


Val 


Thr 


Thr 


Met 


Lys 


Tyr 


Ser 




130 










135 










140 










His 


Asp 


Ser 


Asp 


Trp 


Met 


lie 


Ser Gly Asp 


Ala 


Asp 


Gly Met 


lie 


Lys 


145 










150 










155 










160 


lie 


Trp 


Gin 


Pro 


Asn 


Phe 


Ser 


Met 


Val 


Lys 


Glu 


lie Asp 


Ala 


Ala 


His 










165 










170 










175 




Thr 


Glu 


Ser 


He 


Arg 


Asp 


Met 


Ala 


Phe 


Ser 


Ser 


Asn Asp 


Ser 


Lys 


Phe 








180 










185 










190 






Val 


Thr 


Cys 


Ser 


Asp 


Asp 


Asn 


lie 


Leu 


Lys 


lie 


Trp Asn 


Phe 


Ser 


Asn 






195 










200 










205 








Gly Lys 


Gin 


Glu 


Arg 


Val 


Leu 


Ser 


Gly 


His 


His 


Trp Asp 


Val 


Lys 


Ser 




210 










215 










220 










Cys 


Asp 


Trp 


His 


Pro 


Glu 


Met 


Gly 


Leu 


lie 


Ala 


Ser 


Ala 


Ser 


Lys 


Asp 


225 










230 










235 










240 
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Asn Leu Val Lys Leu Trp Asp Pro 
245 



lie Leu Lys Phe Lys His Thr Val 
260 



Lys Gly Asn Leu Leu Met Ala lie 
275 280 



Phe Asp lie Arg Tyr Ser Met Lys 
290 295 



Thr Asp Tyr Met Thr Leu Glu Trp 
305 310 



Thr Leu Ala Cys Tyr Asp Gly Ser 
325 



Asn Leu Asn Glu Pro lie Leu Thr 
340 



lie Thr Ser Leu Ser Tyr Asn Pro 
355 360 



Ala Lys Asp Arg Thr lie Arg Phe 
370 375 



Pro Asn Ala Tyr Asp Asp Pro Thr 
385 390 



Trp Phe Phe Gly lie Asn Asn Asp 
405 



Glu Phe Gly Ala Ala Pro Pro Pro 
420 



Leu Pro Asn Met Asn Gly Phe lie 
435 440 



Pro Gly He Asp Ser Asn lie Lys 
450 455 



Arg Ser Gly Asn Cys lie Ser Ser 
250 255 



Leu Lys Thr Arg Phe Gin Pro Thr 
265 270 



Ser Lys Asp Lys Ser Cys Arg Val 
285 



Glu Leu Met Cys Val Arg Asp Glu 
300 



His Pro lie Asn Glu Ser Met Phe 
315 320 



Leu Lys His Phe Asp Leu Leu Gin 
330 335 



lie Pro Tyr Ala His Asp Lys Cys 
345 350 



Val Gly His lie Phe Ala Thr Ala 
365 



Trp Thr Arg Ala Arg Pro lie Asp 
380 



Tyr Asn Asn Lys Lys lie Asn Gly 
395 400 



lie Asn Ala Val Arg Glu Lys Ser 
410 415 



Pro Ala Thr Leu Glu Pro His Ala 
425 430 



Asn Lys Lys Pro Arg Gin Glu lie 
445 



Ser Ser Thr Leu Pro Gly Leu Ser 
460 



lie 

465 



< 210 > 44 
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<211> 


1398 


<212> 


DNA 


<213 > 


Saccharomyces cerevisiae 


<400> 


44 



atggacgggc ataatcagaa ccagtaccag aatcagaatc aaattcagca gtcacaacaa 60 

ccacctctga agaaatatgt gacacagaga cgatcggtag acgtgagttc tccgtatatt 120 

aacctatatt acaacagacg acatggatta ccgaacctag tggtagagcc agaaacttca 180 

tacacaatag atataatgcc gcctaatgcc tacagaggtc gagatcgagt cataaatttg 240 

cccagcaaat ttacgcattt aagctcgaat aaagtgaaac atgtgatacc cgccatccaa 300 

tggaccccag aaggcagaag acttgtagta gcaacttata gcggagaatt ttctctatgg 360 

aacgcatctt cgtttacgtt tgaaaccctc atgcaagcac atgattcggc tgtgactaca 420 

atgaaatatt ctcacgacag tgattggatg attagtgggg acgcagatgg aatgattaag 480 

atatggcaac caaactttag tatggttaaa gaaattgatg ctgcacacac ggaaagtatt 540 

agggatatgg cattcagtag taacgactcg aaatttgtta cctgttcaga tgataatatc 600 

ttgaagattt ggaacttcag caatggcaaa caagaaagag tattatcggg gcaccactgg 660 

gacgtgaaaa gttgtgattg gcatcccgag atggggttaa ttgcctccgc atctaaggac 720 

aatttagtaa aactgtggga tccgcgttcc ggaaactgca tatcatcaat tctgaagttc 780 

aaacatactg tcttgaagac gcgatttcaa cccaccaagg gaaatctatt aatggcgatt 840 

tcaaaagata aatcgtgtcg tgtgttcgac attagataca gtatgaagga actaatgtgt 900 

gtaagagatg aaacggatta tatgacacta gaatggcacc caataaatga atctatgttc 960 

actctggcgt gctatgacgg ctcgctaaag cattttgatc ttttacaaaa tctaaacgaa 1020 

cctattttga ctattccata cgcccatgat aagtgtatca cctcgttatc atataatcca 1080 

gtaggacata tatttgccac agcagctaag gatagaacca taagattttg gactagagcg 1140 

aggccaattg atcctaatgc atatgatgat ccaacttata acaataagaa gataaatgga 1200 

tggttttttg gcatcaataa cgacatcaat gctgtgagag agaagagtga attcggtgcg 1260 

gccccgccac caccagctac acttgaacca catgcgttac caaacatgaa tggattcatc 1320 

aacaagaaac cgcgtcaaga aatcccaggc atcgattcaa acatcaagag ttccacattg 1380 

ccaggtttaa gcatataa 

<210> 45 

<211> 785 

<212> PRT 

<213 > Sac char omyces cerevisiae 
<400> 45 

Met Ser Ser Ala Glu Met Glu Gin Leu Leu Gin Ala Lys Thr Leu Ala 
15 10 15 



1398 
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Met 


His 


Asn 


Asn 


Pro 


Thr 


Glu 


Met 


Leu 


Pro 


Lys 


Val 


Leu 


Glu 


Thr 


Thr 








20 










25 










30 






Ala 


Ser 


Met 


Tyr 


His 


Asn Gly 


Asn 


Leu 


Ser 


Lys 


Leu 


Lys 


Leu 


Pro 


Leu 






35 










40 










45 








Ala 


Lys 


Phe 


Phe 


Thr 


Gin 


Leu 


Val 


Leu 


Asp 


Val 


Val 


Ser 


Met 


Asp 


Ser 




50 










55 










60 










Pro 


lie 


Ala 


Asn 


Thr 


Glu 


Arg 


Pro 


Phe 


lie 


Ala 


Ala 


Gin 


Tyr 


Leu 


Pro 


65 










70 










75 










80 


Leu 


Leu 


Leu 


Ala 


Met 


Ala 


Gin 


Ser 


Thr 


Ala 


Asp 


Val 


Leu 


Val 


Tyr 


Lys 










85 










90 










95 




Asn 


He 


Val 


Leu 


lie 


Met 


Cys 


Ala 


Ser 


Tyr 


Pro 


Leu 


Val 


Leu Asp 


Leu 








100 










105 










110 






Val 


Ala 


Lys 


Thr 


Ser 


Asn 


Gin 


Glu 


Met 


Phe 


Asp 


Gin 


Leu 


Cys 


Met 


Leu 






115 










120 










125 








Lys 


Lys 


Phe 


Val 


Leu 


Ser 


His 


Trp 


Arg 


Thr 


Ala 


Tyr 


Pro 


Leu Arg 


Ala 




13 0 










135 










140 










Thr 


Val 


Asp 


Asp 


Glu 


Thr 


Asp 


Val 


Glu 


Gin 


Trp 


Leu 


Ala 


Gin 


lie 


Asp 


145 










150 










155 










160 


Gin 


Asn 


lie 


Gly Val 


Lys 


Leu 


Ala 


Thr 


lie 


Lys 


Phe 


lie 


Ser 


Glu 


Val 










165 










170 










175 




Val 


Leu 


Ser 


Gin 


Thr 


Lys 


Ser 


Pro 


Ser 


Gly Asn 


Glu 


lie 


Asn 


Ser 


Ser 








180 










185 










190 






Thr 


lie 


Pro 


Asp 


Asn 


His 


Pro 


Val 


Leu 


Asn 


Lys 


Pro 


Ala 


Leu 


Glu 


Ser 






195 










200 










205 








Glu 


Ala 


Lys 


Arg 


Leu 


Leu 


Asp 


Met 


Leu 


Leu 


Asn 


Tyr 


Leu 


lie 


Glu 


Glu 




210 










215 










220 










Gin 


Tyr 


Met 


Val 


Ser 


Ser 


Val 


Phe 


lie 


Gly 


lie 


lie 


Asn 


Ser 


Leu 


Ser 


225 










230 










235 










240 


Phe 


Val 


lie 


Lys 


Arg 


Arg 


Pro 


Gin 


Thr 


Thr 


lie 


Arg 


lie 


Leu 


Ser 


Gly 










245 










250 










255 




Leu 


Leu Arg 


Phe 


Asn 


Val 


Asp Ala 


Lys 


Phe 


Pro 


Leu 


Glu Gly 


Lys 


Ser 
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260 










265 










270 






Asp 


Leu 


Asn 


Tyr 


Lys 


Leu 


Ser 


Lys 


Arg 


Phe 


Val 


Glu 


Arg Ala 


Tyr 


Lys 






275 










280 










285 








Asn 


Phe 


Val 


Gin 


Phe 


Gly Leu 


Lys 


Asn 


Gin 


lie 


lie 


Thr 


Lys 


Ser 


Leu 




290 










295 










300 










Ser 


Ser 


Gly 


Ser 


Gly 


Ser 


Ser 


He 


Tyr 


Ser 


Lys 


Leu 


Thr 


Lys 


He 


Ser 


305 










310 










315 










320 


Gin 


Thr 


Leu 


His 


val 


lie 


Gly Glu 


Glu 


Thr 


Lys 


Ser 


Lys 


Gly 


lie 


Leu 










325 










330 










335 




Asn 


Phe 


Asp 


Pro 


Ser 


Lys 


Gly Asn 


Ser 


Lys 


Lys 


Thr 


Leu 


Ser 


Arg 


Gin 








340 










345 










350 






Asp 


Lys 


Leu 


Lys 


Tyr 


lie 


Ser 


Leu 


Trp 


Lys 


Arg 


Gin 


Leu 


Ser 


Ala 


Leu 






355 










360 










365 








Leu 


Ser 


Thr 


Leu 


Gly Val 


Ser 


Thr 


Lys 


Thr 


Pro 


Thr 


Pro 


Val 


Ser 


Ala 




370 










375 










380 










Pro 


Ala 


Thr 


Gly 


Ser 


Ser 


Thr 


Glu 


Asn 


Met 


Leu 


Asp 


Gin 


Leu 


Lys 


lie 


385 










390 










395 










400 


Leu 


Gin 


Lys 


Tyr 


Thr 


Leu 


Asn 


Lys 


Ala 


Ser 


His 


Gin Gly 


Asn 


Thr 


Phe 










405 










410 










415 




Phe 


Asn 


Asn 


Ser 


Pro 


Lys 


Pro 


lie 


Ser 


Asn 


Thr 


Tyr,. 


Ser 


Ser 


Val 


Tyr 








420 










425 










430 






Ser 


Leu 


Met 


Asn 


Ser 


Ser 


Asn 


Ser 


Asn 


Gin Asp 


Val 


Thr 


Gin 


Leu 


Pro 






435 










440 










445 








Asn Asp 


He 


Leu 


lie 


Lys 


Leu 


Ser 


Thr 


Glu 


Ala 


lie 


Leu 


Gin 


Met 


Asp 




450 










455 










460 










Ser 


Thr 


Lys 


Leu 


He 


Thr 


Gly 


Leu 


Ser 


lie 


Val 


Ala 


Ser 


Arg 


Tyr 


Thr 


465 










470 










475 










480 


Asp 


Leu 


Met 


Asn 


Thr 


Tyr 


lie 


Asn 


Ser 


Val 


Pro 


Ser 


Ser 


Ser 


Ser 


Ser 










485 










490 










495 




Lys 


Arg 


Lys 


Ser 


Asp 


Asp 


Asp 


Asp 


Asp Gly Asn Asp 


Asn 


Glu 


Glu 


Val 








500 










505 










510 
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Gly Asn Asp 


Gly 


Pro 


Thr 


Ala 


Asn 


Ser 


Lys 


Lys 


He 


Lys 


Met 


Glu 


Thr 






515 










520 










525 








Glu 


Pro 


Leu 


Ala 


Glu 


Glu 


Pro 


Glu 


Glu 


Pro 


Glu Asp Asp 


Asp 


Arg 


Met 




530 










535 










540 










Gin 


Lys 


Met 


Leu 


Gin 


Glu 


Glu 


Glu 


Ser 


Ala 


Gin 


Glu 


lie 


Ser 


Gly Asp 


545 










550 










555 










560 


Ala 


Asn 


Lys 


Ser 


Thr 


Ser 


Ala 


lie 


Lys 


Glu 


lie 


Ala 


Pro 


Pro 


Phe 


Glu 










565 










570 










575 




Pro 


Asp 


Ser 


Leu 


Thr 


Gin Asp 


Glu 


Lys 


Leu 


Lys 


Tyr 


Leu 


Ser 


Lys 


Leu 








580 










585 










590 






Thr 


Lys 


Lys 


Leu 


Phe 


Glu 


Leu 


Ser 


Gly Arg 


Gin 


Asp 


Thr 


Thr Arg Ala 






595 










600 










605 








Lys 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


He 


Leu 


Leu Asp 


Asp 


Asp 


Asp 


Ser 


Ser 




610 










615 










620 










Ser 


Trp 


Leu 


His 


Val 


Leu 


He 


Arg 


Leu 


Val 


Thr 


Arg 


Gly 


lie 


Glu 


Ala 


625 










630 










635 










640 


Gin 


Glu 


Ala 


Ser 


Asp 


Leu 


lie 


Arg 


Glu 


Glu 


Leu 


Leu 


Gly Phe 


Phe 


lie 










645 










650 










655 




Gin 


Asp 


Phe 


Glu 


Gin 


Arg 


Val 


Ser 


Leu 


lie 


lie 


Glu 


Trp 


Leu 


Asn 


Glu 








660 










665 










670 






Glu 


Trp 


Phe 


Phe 


Gin 


Thr 


Ser 


Leu 


His 


Gin 


Asp 


Pro 


Ser 


Asn 


Tyr 


Lys 






675 










680 










685 








Lys 


Trp 


Ser 


Leu 


Arg 


Val 


Leu 


Glu 


Ser 


Leu Gly 


Pro 


Phe 


Leu 


Glu 


Asn 




690 










695 










700 










Lys 


His 


Arg Arg 


Phe 


Phe 


lie 


Arg 


Leu 


Met 


Ser 


Glu 


Leu 


Pro 


Ser 


Leu 


705 










710 










715 










720 


Gin 


Ser 


Asp 


His 


Leu 


Glu 


Ala 


Leu 


Lys 


Pro 


lie 


Cys 


Leu 


Asp 


Pro 


Ala 










725 










730 










735 




Arg 


Ser 


Ser 


Leu 


Gly 


Phe 


Gin 


Thr 


Leu 


Lys 


Phe 


Leu 


lie 


Met 


Phe 


Arg 








740 










745 










750 






Pro 


Pro 


Val 


Gin 


Asp 


Thr 


Val 


Arg Asp 


Leu 


Leu 


His 


Gin 


Leu 


Lys 


Gin 






755 










760 










765 
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Glu Asp Glu Gly Leu His Lys Gin Cys Asp Ser Leu Leu Asp Arg Leu 
770 775 780 

Lys 

785 

<210> 46 

< 211 > 2000 
<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 46 



atgtcatctg 


cagagatgga 


acaattgtta 


caggccaaga 


cactggccat 


gcacaacaat 


60 


ccaacggaga 


tgctgcccaa 


ggtgctcgaa 


actacggcat 


ccatgtacca 


caacggtaat 


120 


ctcagcaagc 


tgaagttgcc 


tttggccaag 


ttttttacac 


agttagttct 


agacgtggtg 


180 


tcgatggact 


ctccaattgc 


gaatactgag 


agaccgttta 


ttgctgctca 


atatctgcca 


240 


ctacttcttg 


ctatggcgca 


atccaccgcg 


gacgtactag 


tgtacaagaa 


tatcgtgctt 


300 


attatgtgcg 


cttcataccc 


gctggtgttg 


gatctggttg 


ctaagacatc 


aaaccaggaa 


360 


atgtttgatc 


agttgtgtat 


gctgaagaag 


ttcgtgctct 


cgcactggag 


aactgcatat 


420 


cctttgcgtg 


ccaccgttga 


cgatgaaacg 


gatgtcgaac 


aatggctggc 


gcagattgac 


480 


caaaatatcg 


gcgtgaaatt 


agcgaccatc 


aagttcatat 


ctgaggtcgt 


gctgtcgcaa 


540 


actaaatcac 


ccagcggcaa 


cgagattaat 


tcatctacca 


tcccggataa 


ccaccctgtg 


600 


ttgaacaaac 


cggctttgga 


gagcgaggct 


aagaggcttc 


ttgatatgtt 


gctaaactac 


660 


ctaattgagg 


aacagtacat 


ggtctcgtcc 


gttttcattg 


gtatcatcaa 


ttctttatcc 


720 


ttcgtcatca 


aaagaaggcc 


gcagacaaca 


ataagaattc 


tttccgggct 


gttgcgtttc 


780 


aacgtcgacg 


ccaagtttcc 


cctagagggc 


aagtctgact 


tgaactacaa 


actatccaag 


840 


agatttgttg 


aaagggcgta 


caagaacttt 


gtgcaatttg 


ggctaaaaaa 


tcaaatcatt 


900 


acaaaatccc 


tctcatccgg 


atcagggtca 


tcgatctact 


ccaagctgac 


caagatttct 


960 


caaactttac 


acgttattgg 


cgaagagacc 


aagagcaagg 


gaattttgaa 


cttcgaccct 


1020 


tccaagggca 


atagcaagaa 


aacgttgtcc 


aggcaggaca 


aactaaaata 


catctcacta 


1080 


tggaaaaggc 


aattatccgc 


gttattgtct 


actctagggg 


tgtccacaaa 


gacccccacg 


1140 


cctgtgtccg 


cacctgcaac 


gggctcttca 


accgaaaaca 


tgcttgatca 


actgaagata 


1200 


ttgcaaaaat 


acaccctcaa 


caaggcttca 


caccagggca 


atactttttt 


caacaactca 


1260 


cccaaaccaa 


tcagcaacac 


ctactcatct 


gtgtactcat 


tgatgaacag 


ttcgaactcc 


1320 


aaccaggatg 


tgacccagct 


acccaatgac 


atacttatca 


agctgtccac 


agaggccatc 


1380 


ttgcaaatgg 


acagcacgaa 


actgatcacc 


ggattgtcta 


tcgttgcttc 


gaggtacacg 


1440 
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gatttaatga atacgtacat caattctgta ccgtcctcgt catcatcaaa gaggaaatcc 1500 
gacgatgatg acgacggcaa cgacaatgaa gaagttggaa acgatggccc aacggctaat 1560 
agcaagaaaa tcaaaatgga aacagaacca ctagcggagg aaccagagga gcccgaagac 1620 
gatgaccgaa tgcagaagat gcttcaagaa gaggaaagcg cccaagaaat ctcaggagat 1680 
gccaacaaat caacttctgc cattaaggag atcgcacccc cctttgaacc tgactcattg 1740 
acgcaggatg aaaaactaaa gtacctctca aagctgacca agaaactgtt tgaattatcc 1800 
ggtcgccagg atacfcacccg ggccaaatct tcgtcttcct cctccatatt actggacgat 1860 
gacgactcct cgtcatggtt acacgtctta atcagattgg ttacgagagg aatcgaagca 1920 
caagaggcca gtgacctgat tcgtgaagaa ctgcttggct tcttcatcca ggatttcgag 1980 
caacgtgtca gtctgatcat 2000 



<210> 47 

<211> 533 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 47 

Met Ser Ala Pro Val Pro Gin Leu Val Asn lie Ser His Ala Leu Gin . 
15 10 15 



Ala Ser Thr lie Gin Gin lie Arg Leu Asp Met Val Asp Phe Asn Lys 
20 25 30 



Asp Cys Lys Leu Ser Ser lie Gin Leu Ala Arg lie Asp Lys Tyr lie 
35 40 45 



Asp Ser Leu Gin Ala Ala Leu Asn Gin Phe Thr Lys Asp Asn Leu His 
50 55 60 



lie Glu Arg Lys Glu Lys Asn Val Thr Glu Ala Asp lie Gin Leu Tyr 
65 70 75 80 



Ser Gly Leu Lys Ser Met Tyr Leu Asp Tyr Leu Asn Gin Leu lie Lys 
85 90 95 



Leu Lys His Glu Lys Gin His His Ser Thr Pro Pro lie Ala Asn Asp 
100 105 110 



Val Ser Leu Asp Phe Phe Val Asn Gin Leu Pro Lys Phe Ser Pro Glu 
115 120 125 



Glu Arg Lys Asn Tyr lie Asp Asn Leu lie Leu Asn Lys Asn Ser His 
130 135 140 
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Asn Arg Leu Ser Lys Met Asp Gly Leu Val Asp Ala Val lie Asn Leu 
145 150 155 160 



Cys Val Leu Asp Thr Ser Val Ala Glu Asn Val Arg Ser Tyr Met Lys 
165 170 175 



Leu Leu Asp Thr Leu Gly Phe Gin Lys Gly Ser Asn Ser Thr Gly Thr 
180 185 190 



Lys Ala Asn Leu Lys Lys Lys Leu Ala Ser Ser Lys Ala Lys lie Lys 
195 200 205 



Asp Ser Glu Lys Glu Lys Glu Lys Glu Lys Asp Lys Ser Lys Val Lys 
210 215 220 



Met Lys Thr Lys Leu Lys Pro Ser Pro Leu Leu Asn Asn Asp Asp Lys 
225 230 235 240 



Asn Ser Ser Pro Ser Pro Thr Ala Ser Thr Ser Ser Met Lys Lys Leu 
245 250 255 



Lys Ser Gly Leu Phe Asn Lys Asn Glu Ala Lys Ser Thr Glu Ser Leu 
260 265 270 



Pro Thr Ser Ser Lys Lys Lys Leu Ser Phe Ser Lys Tyr Leu Asn Lys 
275 280 285 



Asp Asp Ala Asp Met Thr Lys Leu Gly Thr Lys Arg Ser lie Asp Val 
290 295 300 



Asp Phe Lys Val Asn Pro Glu Ala Ser Thr Val Ala Ser Asn lie lie 
305 310 315 320 



Ser Ser Ser Thr Ser Gly Ser Ser Thr Thr Thr Val Ala Thr Pro Ala 
325 330 335 



Ser Ser Glu Glu Pro Leu Lys Lys Lys Thr Lys lie Ser Val Gin Asp 
340 345 350 



Ser Asn Val Gin Ser He Leu Arg Asn Gly Lys Pro Lys Lys Ala Arg 
355 360 365 



lie Ser Ser lie Lys Phe Leu Asp Asp Ser Gin Leu lie Lys Val Tyr 
370 375 380 



Gly Asp Asp Leu Pro Asn Gin Gly Leu Gin Val Ser Pro Thr Gin Leu 
385 390 395 400 
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Lys Lys He Leu Lys Pro Phe Lys Glu Gly Glu Pro Lys Glu lie lie 
405 410 415 



Leu Phe Glu Asp Met Ser lie Lys Leu Lys Pro Leu Asp Leu Met Phe 
420 425 430 



Leu Lys Asn Thr Asn Ser Asp Asp Tyr Met Asp lie Ser Glu Thr Lys 
435 440 445 



Gly Gly Pro lie His Cys Glu Thr Arg Thr Pro Leu lie Tyr Arg Lys 
450 455 460 



Asn Phe Asn His Phe Asn Pro Asp Leu Asn Lys Arg Pro Pro Arg Glu 
465 470 475 480 



Pro lie Glu Phe Asp Leu Asn Gly Asn Thr Asn Ser Thr Pro Thr lie 
485 490 495 



Ala Lys Ala Phe Gly Lys Asn Ser Leu Leu Leu Arg Lys Asp Arg Gly 
500 505 510 



Gly Leu Pro Tyr Lys His Val Pro lie Val Lys Arg Asn Lys Tyr Pro 
515 520 525 



Pro Arg Pro Val His 
530 



<210> 48 

<211> 1602 
<212> DNA 

<213> Saccharornyces cerevisiae 
<400> 48 

atgtcagcac ctgttcctca attagtaaat atttctcatg ccttgcaagc ttccactatc 60 

cagcaaattc gtttggatat ggtagatttt aataaagatt gcaaattatc ttccattcaa 120 

ctggcaagaa ttgacaagta tatcgattct ttgcaagcgg ctttgaatca gtttaccaaa 180 

gataacttgc acatagaacg gaaggaaaag aacgtgactg aagcagatat acagctgtat 240 

tctggcttaa aatcaatgta tctagattat ttgaaccagc taattaaact aaagcatgag 300 

aaacaacacc actctacacc gcccatcgct aatgatgtct ccctggactt cttcgttaac 360 

caattgccta agttttctcc cgaggaaagg aaaaattaca ttgacaattt aatcctgaat 420 

aaaaatagcc ataatcgatt atctaaaatg gatggtctgg tagatgcagt catcaattta 480 

tgcgtcttgg atacttccgt tgctgaaaat gtacgatcct atatgaaatt attagataca 540 

ttgggtttcc aaaagggttc gaatagtact ggtacaaagg ccaacctcaa aaagaaattg 600 
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gctagttcga 


aagcaaagat 


aaaggattca 


gaaaaagaaa 


aggagaagga gaaggataaa 


660 


tcaaaagtca 


agatgaaaac 


taaattaaaa 


ccttctcctt 


tgctcaataa 


cgatgacaaa 


720 


aattcttctc 


catcgcctac 


tgcatctacc 


tcttcaatga 


agaaattgaa 


gtcgggttta 


780 


ttcaataaaa 


atgaagctaa 


gtctacagaa 


tctctaccta 


cttcttccaa 


gaaaaaacta 


840 


tcattttcta 


aatatctgaa 


caaggatgac 


gcagatatga 


ccaagcttgg 


gactaaacgg 


900 


tcaatagatg 


tggatttcaa 


agtcaacccc 


gaagcatcca 


cggtggcttc 


taatatcata 


960 


tcttcgtcaa 


cgtcaggatc 


gtcaaccaca 


acggtagcga 


ctcctgcttc 


ttcagaagag 


1020 


cccttaaaaa 


aaaaaaccaa 


aatatccgtg 


caagactcta 


atgtacaatc 


gattttgaga 


1080 


aatggtaaac 


cgaaaaaagc 


acgcataagt 


agcatcaaat 


ttttggatga 


ttcccaacta 


1140 


ataaaagttt 


acggtgacga 


tctaccgaac 


caagggctac 


aagtttctcc 


tactcaattg 


1200 


aaaaaaattc 


tgaaaccatt 


caaggagggg 


gaaccgaagg 


aaattatatt 


gttcgaggat 


1260 


atgtcaatca 


aattaaaacc 


tcttgatttg 


atgtttctga 


agaacacaaa 


cagtgatgac 


1320 


tatatggata 


tatccgagac 


taaaggtggc 


ccaatacatt 


gtgaaacaag 


gaccccgttg 


1380 


atctatagaa 


aaaatttcaa 


tcatttcaac 


ccggacttga 


ataaaaggcc 


gccaagagaa 


1440 


cccatagaat 


tcgacttaaa 


tggaaatacg 


aactcaaccc 


cgactatagc 


aaaggctttc 


1500 


ggtaaaaata 


gtttattact 


aaggaaggac 


agaggtggtt 


tgccatacaa 


gcatgtcccc 


1560 


atagtaaaaa 


gaaataaata 


tcctccaaga 


ccagtacact 


aa 




1602 



<210> 49 

<211> 677 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 49 

Met Ser Ser Ser Thr Thr Pro Asp Leu Leu Tyr Pro Ser Ala Asp Lys 
15 10 15 

Val Ala Glu Pro Ser Asp Asn lie His Gly Asp Glu Leu Arg Leu Arg 
20 25 30 

Glu Arg lie Lys Asp Asn Pro Thr Asn lie Leu Ser Tyr Phe Gin Leu 
35 40 45 

lie Gin Tyr Leu Glu Thr Gin Glu Ser Tyr Ala Lys Val Arg Glu Val 
50 55 60 

Tyr Glu Gin Phe His Asn Thr Phe Pro Phe Tyr Ser Pro Ala Trp Thr 
65 70 75 80 
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Leu 


Gin 


Leu 


Lys 


Gly Glu 


Leu 


Ala 


Arg Asp 


Glu 


Phe 


Glu 


Thr 


Val 


Glu 










85 










90 










95 




Lys 


He 


Leu 


Ala 


Gin 


Cys 


Leu 


Ser 


Gly Lys 


Leu 


Glu 


Asn 


Asn Asp 


Leu 








100 










105 










110 






Ser 


Leu 


Trp 


Ser 


Thr 


Tyr 


Leu 


Asp 


Tyr 


lie 


Arg Arg 


Lys 


Asn 


Asn 


Leu 






115 










120 










125 








He 


Thr 


Gly 


Gly Gin 


Glu 


Ala 


Arg 


Ala 


Val 


lie 


Val 


Lys 


Ala 


Phe 


Gin 




130 










135 










140 










Leu 


Val 


Met 


Gin 


Lys 


Cys 


Ala 


He 


Phe 


Glu 


Pro 


Lys 


Ser 


Ser 


Ser 


Phe 


145 










150 










155 










160 


Trp Asn 


Glu 


Tyr 


Leu 


Asn 


Phe 


Leu 


Glu 


Gin 


Trp 


Lys 


Pro 


Phe 


Asn 


Lys 










165 










170 










175 




Trp Glu 


Glu 


Gin 


Gin 


Arg 


lie 


Asp 


Met 


Leu 


Arg 


Glu 


Phe 


Tyr 


Lys 


Lys 








180 










185 










190 






Met 


Leu 


Cys 


Val 


Pro 


Phe 


Asp Asn 


Leu 


Glu 


Lys 


Met 


Trp 


Asn Arg 


Tyr 






195 










200 










205 








Thr 


Gin 


Trp 


Glu 


Gin 


Glu 


lie 


Asn 


Ser 


Leu 


Thr 


Ala 


Arg 


Lys 


Phe 


lie 




210 










215 










220 










Gly Glu 


Leu 


Ser 


Ala 


Glu 


Tyr 


Met 


Lys 


Ala Arg 


Ser 


Leu 


Tyr 


Gin 


Glu 


225 










230 










235 










240 


Trp 


Leu 


Asn 


Val 


Thr 


Asn 


Gly Leu 


Lys 


Arg Ala 


Ser 


Pro 


lie 


Asn 


Leu 










245 










250 










255 




Arg 


Thr 


Ala 


Asn 


Lys 


Lys 


Asn 


lie 


Pro 


Gin 


Pro 


Gly 


Thr 


Ser 


Asp 


Ser 








260 










265 










270 






Asn 


lie 


Gin 


Gin 


Leu 


Gin 


lie 


Trp 


Leu 


Asn 


Trp 


lie 


Lys 


Trp 


Glu Arg 






275 










280 










285 








Glu 


Asn 


Lys 


Leu 


Met 


Leu 


Ser 


Glu Asp 


Met 


Leu 


Ser 


Gin 


Arg 


He 


Ser 




290 










295 










300 










Tyr 


Val 


Tyr Lys 


Gin Gly 


lie 


Gin 


Tyr 


Met 


lie 


Phe 


Ser 


Ala 


Glu 


Met 


305 










310 










315 










320 


Trp 


Tyr 


Asp 


Tyr 


Ser 


Met 


Tyr 


lie 


Ser 


Glu 


Asn 


Ser 


Asp 


Arg 


Gin 


Asn 










325 










330 










335 
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lie 


Leu 


Tyr 


Thr 


Ala 


Leu 


Leu 


Ala 


Asn 


Pro 


Asp 


Ser 


Pro 


Ser 


Leu 


Thr 








340 










345 










350 






Phe 


Lys 


Leu 


Ser 


Glu 


Cys 


Tyr 


Glu 


Leu 


Asp 


Asn 


Asp 


Ser 


Glu 


Ser 


Val 






355 










360 










365 








Ser 


Asn 


Cys 


Phe 


Asp 


Lys 


Cys 


Thr 


Gin 


Thr 


Leu 


Leu 


Ser 


Gin 


Tyr 


Lys 




370 










375 










380 










Lys 


lie 


Ala 


Ser 


Asp 


Val 


Asn 


Ser 


Gly Glu Asp 


Asn 


Asn 


Thr 


Glu 


Tyr 


385 










390 










395 










400 


Glu 


Gin 


Glu 


Leu 


Leu 


Tyr 


Lys 


Gin Arg 


Glu 


Lys 


Leu 


Thr 


Phe 


Val 


Phe 










405 










410 










415 




Cys 


Val 


Tyr 


Met 


Asn 


Thr 


Met 


Lys 


Arg 


lie 


Ser 


Gly Leu 


Ser 


Ala 


Ala 








420 










425 










430 






Arg 


Thr 


Val 


Phe 


Gly 


Lys 


Cys 


Arg 


Lys 


Leu 


Lys 


Arg 


lie 


Leu 


Thr 


His 






435 










440 










445 








Asp 


Val 


Tyr 


Val 


Glu 


Asn 


Ala 


Tyr 


Leu 


Glu 


Phe 


Gin 


Asn 


Gin 


Asn Asp 




450 










455 










460 










Tyr 


Lys 


Thr 


Ala 


Phe 


Lys 


Val 


Leu 


Glu 


Leu 


Gly Leu 


Lys 


Tyr 


Phe 


Gin 


465 










470 










475 










480 


Asn 


Asp 


Gly Val 


Tyr 


lie 


Asn 


Lys 


Tyr 


Leu Asp 


Phe 


Leu 


lie 


Phe 


Leu 










485 










490 










495 




Asn 


Lys 


Asp 


Ser 


Gin 


lie 


Lys 


Thr 


Leu 


Phe 


Glu 


Thr 


Ser 


Val 


Glu 


Lys 








500 










505 










510 






Val 


Gin Asp 


Leu 


Thr 


Gin 


Leu 


Lys 


Glu 


lie 


Tyr 


Lys 


Lys 


Met 


lie 


Ser 






515 










520 










525 








Tyr 


Glu 


Ser 


Lys 


Phe 


Gly Asn 


Leu 


Asn 


Asn 


Val 


Tyr 


Ser 


Leu 


Glu 


Lys 




530 










535 










540 










Arg 


Phe 


Phe 


Glu 


Arg 


Phe 


Pro 


Gin 


Glu 


Asn 


Leu 


lie 


Glu 


Val 


Phe 


Thr 


545 










550 










555 










560 


Ser 


Arg 


Tyr 


Gin 


He 


Gin 


Asn 


Ser 


Asn 


Leu 


lie 


Lys 


Lys 


Leu 


Glu 


Leu 










565 










570 










575 




Thr 


Tyr 


Met 


Tyr 


Asn 


Glu 


Glu 


Glu 


Asp 


Ser 


Tyr 


Phe 


Ser 


Ser Gly Asn 








580 










585 










590 
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Gly Asp Gly His His Gly Ser Tyr Asn Met Ser Ser Ser Asp Arg Lys 
595 600 605 

Arg Leu Met Glu Glu Thr Gly Asn Asn Gly Asn Phe Ser Asn Lys Lys 
610 615 620 

Phe Lys Arg Asp Ser Glu Leu Pro Thr Glu Val Leu Asp Leu Leu Ser 

625 630 635 640 

Val lie Pro Lys Arg Gin Tyr Phe Asn Thr Asn Leu Leu Asp Ala Gin 

645 650 655 

Lys Leu Val Asn Phe Leu Asn Asp Gin Val Glu lie Pro Thr Val Glu 
660 665 670 

Ser Thr Lys Ser Gly 
675 

<210> 50 

<211> 2000 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 50 



atgtccagct 


ctacgactcc 


tgatttacta 


tatccctctg 


cggacaaagt 


cgcagagcct 


60 


agtgacaata 


tacatggaga 


tgaactacga 


cttagagaaa 


ggattaaaga 


caatcccacg 


120 


aatattttat 


catacttcca 


gcttattcaa 


tatttggaaa 


ctcaagagtc 


atacgctaag 


180 


gtgagagaag 


tatacgagca 


atttcataac 


acattcccgt 


tttattcacc 


tgcgtggact 


240 


ttgcaactaa 


agggtgaatt 


ggcaagagat 


gaatttgaga 


ctgttgagaa 


gattttggct 


300 


caatgtcttt 


ctggcaagtt 


ggaaaataat 


gacctatctc 


tttggtcaac 


atatttggac 


360 


tacatacgca 


gaaaaaacaa 


cttaattact 


ggtggacaag 


aggcgagagc 


tgttattgtc 


420 


aaggcattcc 


aactagttat 


gcaaaagtgt 


gcaatttttg 


aacccaaatc 


atcttctttt 


480 


tggaacgaat 


atctcaattt 


tttagagcag 


tggaagccat 


tcaacaaatg 


ggaggagcaa 


540 


cagcgaattg 


acatgctcag 


agaattctac 


aagaaaatgc 


tatgtgttcc 


ttttgataat 


600 


ctagaaaaaa 


tgtggaatag 


atacactcaa 


tgggaacaag 


aaataaattc 


cctaacagcc 


660 


agaaaattta 


ttggcgagtt 


atcagccgaa 


tacatgaaag 


cccgttcctt 


ataccaggaa 


720 


tggttgaacg 


ttactaatgg 


attgaaaagg 


gcatctccaa 


ttaatctgcg 


cacagcaaac 


780 


aagaaaaaca 


taccacaacc 


aggtacctca 


gactcaaaca 


ttcagcagtt 


acagatttgg 


840 


ttgaattgga 


taaaatggga 


aagggagaat 


aagttgatgc 


ttagtgaaga 


tatgctatca 


900 


caaagaatca 


gttacgttta 


taaacaaggt 


attcaataca 


tgatattttc 


tgctgaaatg 


960 
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tggtacgatt 


attcaatgta 


tatatctgaa 


aattcggatc 


gacaaaatat 


cttatatact 


1020 


gcgttattag 


ctaatcccga 


ctcaccttct 


cttacattca 


agttatccga 


atgctacgaa 


1080 


ctggataatg 


attctgaaag 


tgtttctaac 


tgttttgaca 


a 9 tgcactca 


aactttacta 


1140 


tcgcagtata 


aaaagatcgc 


ctccgatgta 


aattcgggtg 


aagataataa 


cacagagtat 


1200 


gaacaagagc 


tgctatacaa 


acagagggaa 


aaattaacat 


tcgtgttttg 


cgtgtatatg 


1260 


aatacgatga 


aaagaatatc 


aggactatcc 


gcagcacgta 


ctgtatttgg 


taaatgtcgt 


1320 


aaactgaagc 


gtatattaac 


acatgacgtc 


tacgtggaaa 


atgcatattt 


agaatttcaa 


1380 


aatcaaaacg 


attataagac 


tgcttttaag 


gttttagaat 


tgggtttaaa 


atacttccaa 


1440 


aacgatggag 


tttatatcaa 


caaatactta 


gattttttaa 


tatttttaaa 


taaggattcg 


1500 


cagatcaaaa 


ccttatttga 


aacatcagtg. 


gaaaaagtgc 


aagatttaac 


ccagctgaag 


1560 


gaaatataca 


agaaaatgat 


aagttatgaa 


tcgaaattcg 


gtaacttaaa 


caacgtttat 


1620 


tctctagaga 


aaagattttt 


cgaacggttc 


ccccaagaaa 


atttgattga 


agttttcaca 


1680 


agtcgttatc 


aaattcaaaa 


ctccaactta 


ataaagaaat 


tagagttaac 


ttatatgtat 


1740 


aatgaggaag 


aagacagtta 


cttttcttct 


ggaaacgggg 


atggccatca 


tggctcttac 


1800 


aatatgagtt 


cgtcagatag 


aaagagacta 


atggaggaaa 


ctggaaacaa 


tggaaacttc 


1860 


tccaataaga 


aattcaaaag 


agactcagag 


cttccaacag 


aggttcttga 


tttattgagc 


1920 


gttataccaa 


aacgtcaata 


ttttaataca 


aatttactcg 


atgcgcagaa 


attggtgaat 


1980 


tttttaaatg 


atcaagtaga 










2000 



<210> 51 

<211> 296 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 51 

Met Asn Arg Gin Ser Gly Val Asn Ala Gly Val Gin Asn Asn Pro Pro 
15 10 15 

Ser Arg Val Val Tyr Leu Gly Ser lie Pro Tyr Asp Gin Thr Glu Glu 
20 25 30 

Gin lie Leu Asp Leu Cys Ser Asn Val Gly Pro Val He Asn Leu Lys 
35 40 45 

Met Met Phe Asp Pro Gin Thr Gly Arg Ser Lys Gly Tyr Ala Phe lie 
50 55 60 

Glu Phe Arg Asp Leu Glu Ser Ser Ala Ser Ala Val Arg Asn Leu Asn 
65 70 75 80 
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Gly Tyr Gin Leu Gly Ser Arg Phe Leu Lys Cys Gly Tyr Ser Ser Asn 
85 90 95 



Ser Asp lie Ser Gly Val Ser Gin Gin Gin Gin Gin Gin Tyr Asn Asn 
100 105 110 



lie Asn Gly Asn Asn Asn Asn Asn Gly Asn Asn Asn Asn Asn Ser Asn 
115 120 125 



Gly Pro Asp Phe Gin Asn Ser Gly Asn Ala Asn Phe Leu Ser Gin Lys 
130 135 140 



Phe Pro Glu Leu Pro Ser Gly lie Asp Val Asn He Asn Met Thr Thr 
145 150 155 160 



Pro Ala Met Met lie Ser Ser Glu Leu Ala Lys Lys Pro Lys Glu Val 
165 170 175 



Gin Leu Lys Phe Leu Gin Lys Phe Gin Glu Trp Thr Arg Ala His Pro 
180 185 190 



Glu Asp Ala Val Ser Leu Leu Glu Leu Cys Pro Gin Leu Ser Phe Val 
195 200 205 



Thr Ala Glu Leu Leu Leu Thr Asn Gly lie Cys Lys Val Asp Asp Leu 
210 215 220 



lie Pro Leu Ala Ser Arg Pro Gin Glu Glu Ala Ser Ala Thr Asn Asn 
225 230 235 240 



Asn Ser Val Asn Glu Val Val Asp Pro Ala Val Leu Asn Lys Gin Lys 
245 250 255 



Glu Leu Leu Lys Gin Val Leu Gin Leu Asn Asp Ser Gin lie Ser lie 
260 265 270 



Leu Pro Asp Asp Glu Arg Met Ala lie Trp Asp Leu Lys Gin Lys Ala 
275 280 285 



Leu Arg Gly Glu Phe Gly Ala Phe 
290 295 



<210> 52 

<211> 891 

<212> DNA 

<213> Saccharomyces cerevisiae 
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<400> 52 

atgaataggc agagcggtgt gaatgcgggg gtacagaaca acccaccatc ccgagtggtg 60 

tatctgggtt ctataccata cgatcaaaca gaggagcaga tacttgattt atgtagtaat 120 

gttgggcccg tgatcaattt gaaaatgatg ttcgaccccc aaactggtag gtcgaaaggg 180 

tacgcgttta ttgaatttag agatttagag tccagtgcca gcgcagtacg taatttgaat 240 

ggataccaat taggctctag gtttttgaaa tgcggttact ccagcaatag tgatatatcg 300 

ggagtttcac aacagcaaca acaacagtac aacaacatta atgggaacaa taacaacaat 360 

ggaaataata ataataatag taatgggccg gactttcaaa acagcggaaa tgccaatttt 420 

ctaagtcaaa agtttccaga attgccctct ggtatcgacg ttaacataaa catgaccacc 480 

cctgctatga tgatatcgag cgaactagct aaaaaaccga aagaggtgca gttgaaattt 540 

ttacaaaaat tccaagaatg gacaagagcg catcctgaag atgctgtttc gctattagag 600 

ctgtgtccac agttgagttt tgttacggct gaattattgc taacgaatgg gatatgtaaa 660 

gtggatgatt tgatcccgtt agcttccagg ccgcaagaag aggcatcggc tacgaataac 720 

aatagcgtga acgaggtggt ggatccagct gtgcttaaca aacagaaaga actactgaaa 780 

caggtgttac aactgaatga cagtcaaatt tctatcttgc ccgatgatga aaggatggct 840 

atttgggact taaaacaaaa agcattaagg ggagaatttg gtgcattttg a 891 

<210> 53 

<211> 297 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> . 53 

Met Val Val He Ala Asn Ala His Asn Glu Leu lie His Asp Ala Val 
15 10 15 

Leu Asp Tyr Tyr Gly Lys Arg Leu Ala Thr Cys Ser Ser Asp Lys Thr 
20 25 30 

lie Lys lie Phe Glu Val Glu Gly Glu Thr His Lys Leu lie Asp Thr 
35 40 45 

Leu Thr Gly His Glu Gly Pro Val Trp Arg Val Asp Trp Ala His Pro 
50 55 60 

Lys Phe Gly Thr lie Leu Ala Ser Cys Ser Tyr Asp Gly Lys Val Leu 

65 70 75 80 

lie Trp Lys Glu Glu Asn Gly Arg Trp Ser Gin lie Ala Val His Ala 

85 90 95 
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Val His Ser Ala Ser Val Asn Ser Val Gin Trp Ala Pro His Glu Tyr 
100 105 110 



Gly Pro Leu Leu Leu Val Ala Ser Ser Asp Gly Lys Val Ser Val Val 
115 120 125 



Glu Phe Lys Glu Asn Gly Thr Thr Ser Pro lie He lie Asp Ala His 
130 135 140 



Ala lie Gly Val Asn Ser Ala Ser Trp Ala Pro Ala Thr lie Glu Glu 
145 150 155 160 



Asp Gly Glu His Asn Gly Thr Lys Glu Ser Arg Lys Phe Val Thr Gly 
165 170 175 



Gly Ala Asp Asn Leu Val Lys lie Trp Lys Tyr Asn Ser Asp Ala Gin 
180 185 190 



Thr Tyr Val Leu Glu Ser Thr Leu Glu Gly His Ser Asp Trp Val Arg 
195 200 205 



Asp Val Ala Trp Ser Pro Thr Val Leu Leu Arg Ser Tyr Leu Ala Ser 
210 215 220 



Val Ser Gin Asp Arg Thr Cys lie lie Trp Thr Gin Asp Asn Glu Gin 
225 230 235 240 



Gly Pro Trp Lys Lys Thr Leu Leu Lys Glu Glu Lys Phe Pro Asp Val 
245 250 255 



Leu Trp Arg Ala Ser Trp Ser Leu Ser Gly Asn Val Leu Ala Leu Ser 
260 265 270 



Gly Gly Asp Asn Lys Val Thr Leu Trp Lys Glu Asn Leu Glu Gly Lys 
275 280 285 



Trp Glu Pro Ala Gly Glu Val His Gin 
290 295 



<210> 54 

<211> 894 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 54 

atggtcgtca tagctaatgc gcacaacgaa ttaatccatg acgctgttct agactattat 
gggaagcgcc ttgcaacctg ctcttctgac aagacaatca agatctttga agtcgaagga 
gaaacacaca agttaataga cacgttgact ggccacgaag gcccagtttg gcgtgttgat 



60 

120 

180 
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tgggcacatc 


ctaaattcgg 


aaccattttg gcatcgtgtt 


cttatgatgg 


taaagfcgttg 


240 


atttggaagg 


aagaaaacgg 


tagatggtct 


caaattgccg 


ttcatgctgt 


ccactctgct 


300 


tctgtcaact 


ctgttcaatg ggctcctcat 


gaatatggcc 


ccctactgct 


ggttgcttcc 


360 


tctgatggta 


aggtctccgt 


agtagagttc 


aaagaaaacg 


gtactacttc 


cccaataatc 


420 


atcgatgctc 


atgccattgg 


cgttaactct 


gcttcttggg 


ctccagctac 


catcgaagaa 


480 


gatggtgaac 


acaacggtac 


taaagaatct 


cgcaagtttg 


ttactggggg 


tgctgacaat 


540 


ttggtaaaga 


tttggaagta 


caattcagat 


gcccaaactt 


atgttctgga 


aagcacctta 


600 


gaaggtcaca 


gcgattgggt 


tagagacgta 


gcatggtcac 


ctactgttct 


tctacgttct 


660 


tatttggcca 


gtgtttctca 


agatcgcacc 


tgtattattt 


ggactcaaga 


caatgaacaa 


720 


ggcccatgga 


aaaaaacttt 


attaaaagaa 


gaaaaattcc 


cagatgtttt 


atggagagcc 


780 


agttggtctt 


tgtcaggtaa 


tgtactagct 


ctttccggtg 


gcgataataa 


agttacttta 


840 


tggaaggaaa 


atcttgaggg 


taaatgggaa 


cccgctggtg 


aagttcatca 


gtga 


894 



<210> 55 

<211> 1273 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 55 

Met Val Lys Leu Ala Glu Phe Ser Arg Thr Ala Thr Phe Ala Trp Ser 
15 10 15 

His Asp Lys lie Pro Leu Leu Val Ser Gly Thr Val Ser Gly Thr Val 
20 25 30 

Asp Ala Asn Phe Ser Thr Asp Ser Ser Leu Glu Leu Trp Ser Leu Leu 
35 40 45 

Ala Ala Asp Ser Glu Lys Pro He Ala Ser Leu Gin Val Asp Ser Lys 
50 55 60 

Phe Asn Asp Leu Asp Trp Ser His Asn Asn Lys lie lie Ala Gly Ala 

65 70 75 80 

Leu Asp Asn Gly Ser Leu Glu Leu Tyr Ser Thr Asn Glu Ala Asn Asn 

85 90 95 

Ala lie Asn Ser Met Ala Arg Phe Ser Asn His Ser Ser Ser Val Lys 
100 105 110 

Thr Val Lys Phe Asn Ala Lys Gin Asp Asn Val Leu Ala Ser Gly Gly 
115 120 125 
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Asn Asn Gly Glu lie Phe lie Trp Asp Met Asn Lys Cys Thr Glu Ser 
130 135 140 



Pro Ser Asn Tyr Thr Pro Leu Thr Pro Gly Gin Ser Met Ser Ser Val 
145 150 155 160 



Asp Glu Val lie Ser Leu Ala Trp Asn Gin Ser Leu Ala His Val Phe 
165 170 175 



Ala Ser Ala Gly Ser Ser Asn Phe Ala Ser lie Trp Asp Leu Lys Ala 
180 185 190 



Lys Lys Glu Val lie His Leu Ser Tyr Thr Ser Pro Asn Ser Gly lie 
195 200 205 



Lys Gin Gin Leu Ser Val Val Glu Trp His Pro Lys Asn Ser Thr Arg 
210 215 220 



Val Ala Thr Ala Thr Gly Ser Asp Asn Asp Pro Ser lie Leu lie Trp 
225 230 235 240 



Asp Leu Arg Asn Ala Asn Thr Pro Leu Gin Thr Leu Asn Gin Gly His 
245 250 255 



Gin Lys Gly lie Leu Ser Leu Asp Trp Cys His Gin Asp Glu His Leu 
260 265 270 



Leu Leu Ser Ser Gly Arg Asp Asn Thr Val Leu Leu Trp Asn Pro Glu 
275 280 285 



Ser Ala Glu Gin Leu Ser Gin Phe Pro Ala Arg Gly Asn Trp Cys Phe 
290 295 300 



Lys Thr Lys Phe Ala Pro Glu Ala Pro Asp Leu Phe Ala Cys Ala Ser 
305 310 315 320 



Phe Asp Asn Lys lie Glu Val Gin Thr Leu Gin Asn Leu Thr Asn Thr 
325 330 335 



Leu Asp Glu Gin Glu Thr Glu Thr Lys Gin Gin Glu Ser Glu Thr Asp 
340 345 350 



Phe Trp Asn Asn Val Ser Arg Glu Glu Ser Lys Glu Lys Pro Ser Val 
355 360 365 



Phe His Leu Gin Ala Pro Thr Trp Tyr Gly Glu Pro Ser Pro Ala Ala 
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370 375 380 



His Trp Ala Phe Gly Gly Lys Leu Val Gin He Thr Pro Asp Gly Lys 
385 390 395 400 



Gly Val Ser lie Thr Asn Pro Lys lie Ser Gly Leu Glu Ser Asn Thr 
405 410 415 



Thr Leu Ser Glu Ala Leu Lys Thr Lys Asp Phe Lys Pro Leu lie Asn 
420 425 430 



Gin Arg Leu Val Lys Val lie Asp Asp Val Asn Glu Glu Asp Trp Asn 
435 440 445 



Leu Leu Glu Lys Leu Ser Met Asp Gly Thr Glu Glu Phe Leu Lys Glu 
450 455 460 



Ala Leu Ala Phe Asp Asn Asp Glu Ser Asp Ala Gin Asp Asp Ala Asn 
465 470 475 480 



Asn Glu Lys Glu Asp Asp Gly Glu Glu Phe Phe Gin Gin lie Glu Thr 
485 490 495 



Asn Phe Gin Pro Glu Gly Asp Phe Ser Leu Ser Gly Asn lie Glu Gin 
500 505 510 



Thr lie Ser Lys Asn Leu Val Ser Gly Asn lie Lys Ser Ala Val Lys 
515 520 525 



Asn Ser Leu Glu Asn Asp Leu Leu Met Glu Ala Met Val lie Ala Leu 
530 535 540 



Asp Ser Asn Asn Glu Arg Leu Lys Glu Ser Val Lys Asn Ala Tyr Phe 
545 550 555 560 



Ala Lys Tyr Gly Ser Lys Ser Ser Leu Ser Arg lie Leu Tyr Ser lie 
565 570 575 



Ser Lys Arg Glu Val Asp Asp Leu Val Glu Asn Leu Asp Val Ser Gin 
580 585 590 



Trp Lys Phe lie Ser Lys Ala lie Gin Asn Leu Tyr Pro Asn Asp lie 
595 600 605 



Ala Gin Arg Asn Glu Met Leu lie Lys Leu Gly Asp Arg Leu Lys Glu 
610 615 620 
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Asn Gly His Arg Gin Asp Ser Leu Thr Leu Tyr Leu Ala Ala Gly Ser 
625 630 635 640 



Leu Asp Lys Val Ala Ser lie Trp Leu Ser Glu Phe Pro Asp Leu Glu 
645 650 655 



Asp Lys Leu Lys Lys Asp Asn Lys Thr lie Tyr Glu Ala His Ser Glu 
660 665 670 



Cys Leu Thr Glu Phe lie Glu Arg Phe Thr Val Phe Ser Asn Phe lie 
675 680 685 



Asn Gly Ser Ser Thr lie Asn Asn Glu Gin Leu lie Ala Lys Phe Leu 
690 695 700 



Glu Phe lie Asn Leu Thr Thr Ser Thr Gly Asn Phe Glu Leu Ala Thr 
705 710 715 720 



Glu Phe Leu Asn Ser Leu Pro Ser Asp Asn Glu Glu Val Lys Thr Glu 
725 730 735 



Lys Ala Arg Val Leu lie Ala Ser Gly Lys Ser Leu Pro Ala Gin Asn 
740 745 750 



Pro Ala Thr Ala Thr Thr Ser Lys Ala Lys Tyr Thr Asn Ala Lys Thr 
755 760 765 



Asn Lys Asn Val Pro Val Leu Pro Thr Pro Gly Met Pro Ser Thr Thr 
770 775 780 



Ser He Pro Ser Met Gin Ala Pro Phe Tyr Gly Met Thr Pro Gly Ala 
785 790 795 800 



Ser Ala Asn Ala Leu Pro Pro Lys Pro Tyr Val Pro Ala Thr Thr Thr 
805 810 815 



Ser Ala Pro Val His Thr Glu Gly Lys Tyr Ala Pro Pro Ser Gin Pro 
820 825 830 



Ser Met Ala Ser Pro Phe Val Asn Lys Thr Asn Ser Ser Thr Arg Leu 
835 840 845 



Asn Ser Phe Ala Pro Pro Pro Asn Pro Tyr Ala Thr Ala Thr Val Pro 
850 855 860 



Ala Thr Asn Val Ser Thr Thr Ser lie Pro Gin Asn Thr Phe Ala Pro 
865 870 875 880 
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lie Gin Pro Gly Met Pro lie Met Gly Asp Tyr Asn Ala Gin Ser Ser 
885 890 895 



Ser lie Pro Ser Gin Pro Pro lie Asn Ala Val Ser Gly Gin Thr Pro 
900 905 910 



His Leu Asn Arg Lys Ala Asn Asp Gly Trp Asn Asp Leu Pro Leu Lys 
915 920 925 



Val Lys Glu Lys Pro Ser Arg Ala Lys Ala Val Ser Val Ala Pro Pro 
930 . 935 940 



Asn lie Leu Ser Thr Pro Thr Pro Leu Asn Gly lie Pro Ala Asn Ala 
945 950 955 960 



Ala Ser Thr Met Pro Pro Pro Pro Leu Ser Arg Ala Pro Ser Ser Val 
965 970 975 



Ser Met Val Ser Pro Pro Pro Leu His Lys Asn Ser Arg Val Pro Ser 
980 985 990 



Leu Val Ala Thr Ser Glu Ser Pro Arg Ala Ser lie Ser Asn Pro Tyr 
995 1000 1005 



Ala Pro Pro Gin Ser Ser Gin Gin Phe Pro lie Gly Thr He Ser 
1010 1015 1020 



Thr Ala Asn Gin Thr Ser Asn Thr Ala Gin Val Ala Ser Ser Asn 
1025 1030 1035 . 



Pro Tyr Ala Pro Pro Pro Gin Gin Arg Val Ala Thr Pro Leu Ser 
1040 1045 1050 



Gly Gly Val Pro Pro Ala Pro Leu Pro Lys Ala Ser Asn Pro Tyr 
1055 1060 1065 



Ala Pro Thr Ala Thr Thr Gin Pro Asn Gly Ser Ser Tyr Pro Pro 
1070 1075 1080 



Thr Gly Pro Tyr Thr Asn Asn His Thr Met Thr Ser Pro Pro Pro 
1085 1090 1095 



Val Phe Asn Lys Pro Pro Thr Gly Pro Pro Pro lie Ser Met Lys 
1100 1105 1110 



Lys Arg Ser Asn Lys Leu Ala Ser lie Glu Gin Asn Pro Ser Gin 
1115 1120 1125 
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Gly Ala 
1130 


Thr 


Tyr 


Pro 


Pro 


Thr 

1135 


Leu 


Ser 


Ser 


Ser 


Ala 

1140 


Ser 


Pro 


Leu 


Gin 


Pro 

1145 


Ser 


Gin 


Pro 


Pro 


Thr 

1150 


Leu 


Ala 


Ser 


Gin 


Val 

1155 


Asn 


Thr 


Ser 


Ala 


Glu 

1160 


Asn 


Val 


Ser 


His 


Glu 

1165 


lie 


Pro 


Ala Asp 


Gin 

1170 


Gin 


Pro 


lie 


Val 


Asp 

1175 


Phe 


Leu Lys 


Glu 


Glu 

1180 


Leu 


Ala 


Arg Val 


Thr 

1185 


Pro 


Leu 


Thr 


Pro 


Lys 

1190 


Glu Tyr 


Ser 


Lys 


Gin 

1195 


Leu 


Lys 


Asp Cys 


Asp 

1200 


Lys Arg Leu 


Lys 


He 

1205 


Leu 


Phe 


Tyr 


His 


Leu 

1210 


Glu Lys 


Gin Asp 


Leu 

1215 


Leu 


Thr 


Gin 


Pro 


Thr 

1220 


lie 


Asp 


Cys 


Leu 


His 

1225 


Asp 


Leu 


Val 


Ala 


Leu 

1230 


Met 


Lys 


Glu 


Lys 


Lys 

1235 


Tyr Lys 


Glu 


Ala 


Met 

1240 


Val 


He 


His 


Ala 


Asn 

1245 


lie 


Ala 


Thr 


Asn 


His 

1250 


Ala 


Gin 


Glu Gly 


Gly 

1255 


Asn 


Trp 


Leu 


Thr 


Gly 

1260 


Val 


Lys Arg 


Leu 


lie 

1265 


Gly lie 


Ala 


Glu 


Ala 

1270 


Thr 


Leu 


Asn 













<210> 56 

< 211 > 2000 
<212> DNA 

<213> Saccharomyces cerevisiae 



<400> 56 



atggtcaaac 


ttgctgagtt 


ttctcgaaca 


gccacgtttg 


cgtggtcaca 


tgataaaatt 


60 


ccattattgg 


tctctggtac 


cgtatctggt 


acggtggatg 


ctaatttctc 


cactgattca 


120 


tctctagaat 


tgtggtcatt 


gttggctgct 


gattcggaga 


agcctattgc 


ttccttgcaa 


180 


gtggattcca 


aattcaatga 


tttggattgg 


tctcataata 


acaagattat 


tgctggtgct 


240 


ctggataacg 


gtagtttgga 


attgtactcc 


accaatgaag 


caaacaacgc 


tatcaactcc 


300 


atggccagat 


ttagcaacca 


ttcttcctct 


gtgaagacgg 


taaagtttaa 


cgcaaagcaa 


360 


gacaacgttc 


ttgcttcggg 


tggtaacaac 


ggtgaaattt 


ttatttggga 


catgaataaa 


420 


tgcactgaat 


cgccctccaa 


ttatactcca 


ttgacaccgg gtcaatcgat gtcgtccgtt 


480 




WO 02/092626 



94/148 



PCT/EP02/05359 



gacgaggtca 


tttccctagc 


atggaaccaa 


tctttggccc 


atgtttttgc 


atctgccggg 


540 


tcgtctaatt 


tcgcatctat 


ttgggatttg 


aaggctaaga 


aggaagtcat 


tcatctaagt 


600 


tacacttcac 


ctaattcagg 


tatcaagcaa 


cagctgtccg 


ttgttgaatg 


gcacccaaaa 


660 


aactccacaa 


gagtggcaae 


ggctactggt 


agcgataatg 


atccatctat 


cctgatctgg 


720 


gatttaagaa 


acgccaacac 


accattgcag 


actttaaatc 


aaggccatca 


aaagggtatt 


780 


ttgtcattag 


attggtgtca 


tcaggacgaa 


catctattat 


tgtccagtgg 


tagagataat 


840 


accgttcttc 


tatggaaccc 


tgagtcagcc 


gaacaactgt 


cccaattccc 


agctcgtgga 


900 


aactggtgtt 


ttaagaccaa 


atttgcacca 


gaggctccag 


acctatttgc 


ttgtgcctcc 


960 


tttgataaca 


aaattgaggt 


acagactttg 


caaaatctca 


caaacacttt 


ggatgagcaa 


1020 


gaaaccgaaa 


ctaagcagca 


agaatctgaa 


acagattttt 


ggaataatgt 


ttcccgagag 


1080 


gaatcaaaag 


agaagccatc 


tgttttccat 


ttacaagccc 


caacttggta 


tggggaacca 


1140 


tctcccgcag 


ctcattgggc 


tttcggtggt 


aaattggttc 


aaattactcc 


agatggtaaa 


1200 


ggtgtatcta 


taacaaa ccc 


aaaaatttca 


ggcttagaat 


caaacactac 


tttgagtgaa 


1260 


gcgttgaaaa 


ctaaggattt 


caaaccatta 


ataaatcaaa 


gactggtcaa 


agttattgat 


1320 


gacgttaatg 


aagaagattg 


gaatttattg 


gaaaagttat 


caatggacgg 


tactgaggag 


1380 


ttcttgaaag 


aggctcttgc 


attcgacaac 


gatgaatcag 


atgcacaaga 


cgatgccaac 


1440 


aatgagaaag 


aagacgatgg 


ggaagaattc 


tttcaacaaa 


ttgaaaccaa 


tttccaaccc 


1500 


gagggcgatt 


tctccttgtc 


tggtaatatc 


gaacaaacta 


tttccaagaa 


cttggtttct 


1560 


ggcaacatta 


agagcgctgt 


gaaaaattct 


ctagagaatg 


acttactaat 


ggaggccatg 


1620 


gtgatcgcat 


tagattcaaa 


taacgaaaga 


ttaaaggaaa 


gtgtcaagaa 


tgcctatttt 


1680 


gcgaagtatg 


gatctaaatc 


atcgctctcg 


aggatactat 


actccatttc 


taagagggaa 


1740 


gtagatgatt 


tggttgaaaa 


tttggatgtc 


tctcagtgga 


agtttatctc 


taaagcaatt 


1800 


caaaacttat 


atccaaatga 


tatcgcccag 


aggaatgaaa 


tgttgattaa 


attgggagac 


1860 


aggttaaagg 


aaaatggtca 


tagacaagat 


tctttgactt 


tgtacttggc 


tgccggatca 


1920 


ttagataagg 


tggcttcaat 


ttggttatca 


gaatttccag 


atttggagga 


taaattgaag 


1980 


aaagataata 


agacaattta 










2000 



<210> 57 

<211> 649 

< 212 > PRT 

<213 > Saccharomyces cerevisiae 
<400> 57 

Met Ser Arg Ala Val Gly lie Asp Leu Gly Thr Thr Tyr Ser Cys Val 
15 10 15 
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Ala His Phe Ser Asn Asp Arg Val Glu lie lie Ala Asn Asp Gin Gly 
20 25 30 



Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe Thx Asp Thr Glu Arg Leu 
35 40 45 



lie Gly Asp Ala Ala Lys Asn Gin Ala Ala lie Asn Pro His Asn Thr 
50 55 60 



Val Phe Asp Ala Lys Arg Leu lie Gly Arg Lys Phe Asp Asp Pro Glu 
65 70 75 80 



Val Thr Thr Asp Ala Lys His Phe Pro Phe Lys Val lie Ser Arg Asp 
85 90 95 



Gly Lys Pro Val Val Gin Val Glu Tyr Lys Gly Glu Thr Lys Thr Phe 
100 105 110 



Thr Pro Glu Glu He Ser Ser Met Val Leu Ser Lys Met Lys Glu Thr 
115 120 125 



Ala Glu Asn Tyr Leu Gly Thr Thr Val Asn Asp Ala Val Val Thr Val 
130 135 140 



Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin Ala Thr Lys Asp Ala Gly 
145 150 155 160 



Thr lie Ala Gly Met Asn Val Leu Arg He lie Asn Glu Pro Thr Ala 
165 170 175 



Ala Ala lie Ala Tyr Gly Leu Asp Lys Lys Gly Arg Ala Glu His Asn 
180 185 190 



Val Leu lie Phe Asp Leu Gly Gly Gly Thr Phe Asp Val Ser Leu Leu 
195 200 205 



Ser He Asp Glu Gly Val Phe Glu Val Lys Ala Thr Ala Gly Asp Thr 
210 215 220 



His Leu Gly Gly Glu Asp Phe Asp Asn Arg Leu Val Asn His Leu Ala 
225 230 235 240 



Thr Glu Phe Lys Arg Lys Thr Lys Lys Asp lie Ser Asn Asn Gin Arg 
245 250 255 



Ser Leu Arg Arg Leu Arg Thr Ala Ala Glu Arg Ala Lys Arg Ala Leu 
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Ser Ser Ser Ser 
275 


Gin 


Thr 


Ser 


lie 

280 


Glu 


lie Asp Ser Leu Phe 
285 


Glu Gly 


Met Asp Phe Tyr 
290 


Thr 


Ser 


Leu 

295 


Thr 


Arg Ala Arg Phe Glu Glu 
300 


Leu 


Cys 


Ala Asp Leu Phe 
305 


Arg 


Ser 

310 


Thr 


Leu 


Glu 


Pro Val Glu Lys Val 
315 


Leu 


Lys 

320 


Asp Ser Lys Leu 


Asp 

325 


Lys 


Ser 


Gin 


lie 


Asp Glu lie Val Leu 
330 


Val 

335 


Gly 


Gly Ser Thr Arg 
340 


lie 


Pro 


Lys 


lie 


Gin 

345 


Lys Leu Val Ser Asp 
350 


Phe 


Phe 


Asn Gly Lys Glu 
355 


Pro 


Asn Arg 


Ser 

360 


lie 


Asn Pro Asp Glu Ala 
365 


Val 


Ala 


Tyr Gly Ala Ala 
370 


Val 


Gin 


Ala 

375 


Ala 


lie 


Leu Thr Gly Asp Gin 
380 


Ser 


Thr 


Lys Thr Gin Asp 
385 


Leu 


Leu 

390 


Leu 


Leu Asp 


Val Ala Pro Leu Ser 
395 


Leu 


Gly 

400 


lie Glu Thr Ala Gly Gly 
405 


lie 


Met 


Thr 


Lys Leu lie Pro Arg 
410 


Asn 

415 


Ser 


Thr lie Pro Thr 
420 


Lys 


Lys 


Ser 


Glu 


Thr 

425 


Phe Ser Thr Tyr Ala 
430 


Asp 


Asn 


Gin Pro Gly Val 
435 


Leu 


lie 


Gin 


Val 

440 


Phe 


Glu Gly Glu Arg Thr 
445 


Arg 


Thr 


Lys Asp Asn Asn 
450 


Leu 


Leu Gly 
455 


Lys 


Phe 


Glu Leu Ser Gly lie 
460 


Pro 


Pro 


Ala Pro Arg Gly Val 
465 


Pro 

470 


Gin 


lie 


Asp 


Val Thr Phe Asp He 
475 


Asp 


Ala 

480 


Asn Gly He Leu 


Asn 

485 


Val 


Ser 


Ala 


Leu 


Glu Lys Gly Thr Gly 
490 


Lys 

495 


Ser 


Asn Lys lie Thr 
500 


lie 


Thr 


Asn Asp 


Lys 

505 


Gly Arg Leu Ser Lys 
510 


Asp Asp 
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lie Asp Arg Met Val Ser Glu Ala Glu Lys Tyr Arg Ala Asp Asp Glu 
515 520 525 



Arg Glu Ala Glu Arg Val Gin Ala Lys Asn Gin Leu Glu Ser Tyr Ala 
530 535 540 



Phe Thr Leu Lys Asn Thr lie Asn Glu Ala Ser Phe Lys Glu Lys Val 
545 550 555 560 



Gly Glu Asp Asp Ala Lys Arg Leu Glu Thr Ala Ser Gin Glu Thr lie 
565 570 575 



Asp Trp Leu Asp Ala Ser Gin Ala Ala Ser Thr Asp Glu Tyr Lys Asp 
580 585 590 



Arg Gin Lys Glu Leu Glu Gly lie Ala Asn Pro lie Met Thr Lys Phe 
595 600 605 



Tyr Gly Ala Gly Ala Gly Ala Gly Pro Gly Ala Gly Glu Ser Gly Gly 
610 615 620 



Phe Pro Gly Ser Met Pro Asn Ser Gly Ala Thr Gly Gly Gly Glu Asp 
625 630 635 640 



Thr Gly Pro Thr Val Glu Glu Val Asp 
645 



<210> 58 

<211> 1950 

<212> DNA 

<213> Sac char omyces cerevisiae 
<400> 58 

atgtctagag cagttggtat tgatttggga acaacttact cgtgtgttgc tcatttttcc 60 

aatgataggg tagagataat tgcaaatgat caaggtaata ggaccactcc atcgtatgtg 120 

gctttcacag acaccgaaag attaattggt gacgccgcca aaaatcaagc tgcaatcaat 180 

cctcataata cagtttttga tgcaaagcgg ttaattggtc gtaaatttga tgatcctgaa 240 

gtgacgacag atgccaagca cttccctttc aaagttatat ccagagatgg taaacctgta 300 

gtgcaagtag aatataaggg tgaaacgaaa acatttacgc ctgaggaaat ttcttccatg 360 

gttttaagca aaatgaagga aactgctgag aactatttgg gaactacggt caatgatgct 420 

gttgtaactg ttcctgcata tttcaatgat tctcaaagac aagccactaa ggatgcagga 480 

actattgcag ggatgaacgt tttacgtatt atcaatgaac ccactgcagc agcaattgct 540 

tatggcttgg ataagaaagg cagggctgag cacaatgtcc tgatttttga tttgggtggt 600 

ggtacttttg acgtctcttt actttcaatt gatgagggtg tttttgaggt taaggctacc 660 
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gcaggagaca 


ctcatttagg 


tggtgaagat 


tttgataata 


ggttggtgaa 


ccatttagcc 


720 


actgaattca 


aaaggaaaac 


gaaaaaggac 


atctctaata 


atcaaagatc 


gttaagaaga 


780 


ttgagaactg 


cggcagaaag 


agctaagaga 


gcgctttctt 


cctcatctca 


aacctcgatc 


840 


gagatcgatt 


ctttatttga 


aggtatggat 


ttctacactt 


cgttaacaag 


ggcaaggttt 


900 


gaagagctat 


gtgctgattt 


attcagatcc 


acattggaac 


cagtagaaaa 


ggttcttaaa 


960 


gattcgaagc 


tggacaagtc 


ccaaattgat 


gagattgtgt 


tagtcggtgg 


atctaccaga 


1020 


atcccaaaga 


ttcagaaatt 


agtttctgac 


ttcttcaatg 


gcaaagagcc 


taatcgttct 


1080 


atcaacccgg 


atgaggctgt 


tgcttatggt 


gcagccgttc 


aagctgccat 


tttaaccggc 


1140 


gatcaatcaa 


caaagacaca 


agatttacta 


ttattggatg 


ttgcgccatt 


gtccctagga 


1200 


attgaaactg 


caggcggcat 


aatgactaag 


ctaattccta 


gaaactcaac 


gattccaaca 


1260 


aagaaatcgg 


aaaccttctc 


tacctatgca 


gataatcaac 


ctggtgtttt 


aattcaagtc 


1320 


tttgaaggtg 


aaagaacaag 


aacaaaggat 


aataacttac 


ttggtaaatt 


cgaattaagt 


1380 


ggcattccgc 


ctgctcccag 


aggtgtgcct 


caaattgatg 


ttacctttga 


tatcgacgct 


1440 


aatggtattc 


ttaatgtgtc 


tgctttggaa 


aagggtactg 


gtaagagtaa 


caaaatcacg 


1500 


atcactaacg 


ataaaggtag 


gctctcgaag 


gatgatattg 


ataggatggt 


ttctgaagct 


1560 


gaaaaatata 


gggctgacga 


tgaaagggag 


gcagaacgag 


ttcaggctaa 


gaatcagctt 


1620 


gaatcgtatg 


catttacttt 


gaagaatacc 


ataaacgaag 


caagtttcaa 


agagaaagta 


1680 


ggtgaagatg 


atgcaaagag attagaaaca gcgtctcagg 


aaaccattga 


ctggttagat 


1740 


gcatcgcagg 


cagcctctac 


ggacgaatat 


aaggatagac 


aaaaggagtt 


ggaaggcatt 


1800 


gccaatccaa 


taatgacgaa 


attttacggt 


gctggtgccg 


gcgcaggtcc 


tggagcgggg 


1860 


gaatccggtg 


gattccccgg atccatgccc 


aactcgggtg ctacgggagg 


tggagaagat 


1920 


acaggtccaa 


cagtggaaga 


ggttgattga 








1950 



<210> 59 

<211> 206 
<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 59 

Met Pro Ser His Arg Asn Ser Asn Leu Lys Phe Cys Thr Val Cys Ala 
15 10 15 

Ser Asn Asn Asn Arg Ser Met Glu Ser His Lys Val Leu Gin Glu Ala 
20 25 30 

Gly Tyr Asn Val Ser Ser Tyr Gly Thr Gly Ser Ala Val Arg Leu Pro 
35 40 45 
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Gly Leu Ser lie Asp Lys Pro Asn Val Tyr Ser Phe Gly Thr Pro Tyr 
50 55 60 



Asn Asp He Tyr Asn Asp Leu Leu Ser Gin Ser Ala Asp Arg Tyr Lys 
65 70 75 80 



Ser Asn Gly Leu Leu Gin Met Leu Asp Arg Asn Arg Arg Leu Lys Lys 
85 90 95 



Ala Pro Glu Lys Trp Gin Glu Ser Thr Lys Val Phe Asp Phe Val Phe 
100 105 110 



Thr Cys Glu Glu Arg Cys Phe Asp Ala Val Cys Glu Asp Leu Met Asn 
115 120 125 



Arg Gly Gly Lys Leu Asn Lys lie Val His Val lie Asn Val Asp lie 
130 135 140 



Lys Asp Asp Asp Glu Asn Ala Lys lie Gly Ser Lys Ala lie Leu Glu 
145 150 155 160 



Leu Ala Asp Met Leu Asn Asp Lys lie Glu Gin Cys Glu Lys Asp Asp 
165 170 175 



lie Pro Phe Glu Asp Cys lie Met Asp lie Leu Thr Glu Trp Gin Ser 
180 185 190 



Ser His Ser Gin Leu Pro Ser Leu Tyr Ala Pro Ser Tyr Tyr 
195 200 205 



<210> 60 
<211> 621 
<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 60 

atgcctagtc atcgcaattc aaacttgaag ttttgcacag tttgtgcatc aaacaacaat 60 

cgttcaatgg aatcgcataa agtcctgcaa gaagcaggct ataatgttag ctcttacgga 120 

acaggttcag ctgtgagact gcctggtcta tcgatagata agcctaatgt gtactcattt 180 

ggtacaccct ataatgatat atataatgat cttttatcac aatcagcaga ccgttacaag 240 

tcgaacggtt tattgcaaat gctggatcgt aatagaagac tcaaaaaagc acctgaaaaa 300 

tggcaagaaa gtacaaaagt cttcgacttc gttttcactt gtgaagagag atgttttgat 360 

gccgtttgtg aagatttgat gaatagaggt gggaaattaa acaaaatagt gcatgtaatt 420 



aatgttgaca ttaaagatga tgatgaaaat gctaaaattg gtagcaaagc tatattggaa 480 
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ttagctgata tgctcaatga taaaatagaa caatgtgaaa aagatgacat tccctttgaa 
gattgtataa tggacatttt aactgagtgg caaagctcac attctcaact accgtcatta 
tacgctcctt catattacta a 



<210> 61 
<211> 516 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 61 

Met Ser Thr Gin Gin Gin Ser Tyr Thr lie Trp Ser Pro Gin Asp Thr 
15 10 15 



Val Lys Asp Val Ala Glu Ser Leu Gly Leu Glu Asn lie Asn Asp Asp 
20 25 30 



Val Leu Lys Ala Leu Ala Met Asp Val Glu Tyr Arg lie Leu Glu lie 
35 40 45 



lie Glu Gin Ala Val Lys Phe Lys Arg His Ser Lys Arg Asp Val Leu 
50 55 60 



Thr Thr Asp Asp Val Ser Lys Ala Leu Arg Val Leu Asn Val Glu Pro 
65 70 75 80 



Leu Tyr Gly Tyr Tyr Asp Gly Ser Glu Val Asn Lys Ala Val Ser Phe 
85 90 95 



Ser Lys Val Asn Thr Ser Gly Gly Gin Ser Val Tyr Tyr Leu Asp Glu 
100 105 110 



Glu Glu Val Asp Phe Asp Arg Leu lie Asn Glu Pro Leu Pro Gin Val 
115 120 125 



Pro Arg Leu Pro Thr Fhe Thr Thr His Trp Leu Ala Val Glu Gly Val 
130 135 140 



Gin Pro Ala lie lie Gin Asn Pro Asn Leu Asn Asp lie Arg Val Ser 
145 150 155 160 



Gin Pro Pro Phe lie Arg Gly Ala lie Val Thr Ala Leu Asn Asp Asn 
165 170 175 



Ser Leu Gin Thr Pro Val Thr Ser Thr Thr Ala Ser Ala Ser Val Thr 
180 185 190 



540 

600 

621 
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Asp Thr Gly Ala Ser Gin His Leu Ser Asn Val Lys Pro Gly Gin Asn 
195 200 205 



Thr Glu Val Lys Pro Leu Val Lys His Val Leu Ser Lys Glu Leu Gin 
210 215 220 



lie Tyr Phe Asn Lys Val lie Ser Thr Leu Thr Ala Lys Ser Gin Ala 
225 230 235 240 



Asp Glu Ala Ala Gin His Met Lys Gin Ala Ala Leu Thr Ser Leu Arg 
245 250 255 



Thr Asp Ser Gly Leu His Gin Leu Val Pro Tyr Phe lie Gin Phe lie 
260 265 270 



Ala Glu Gin lie Thr Gin Asn Leu Ser Asp Leu Gin Leu Leu Thr Thr 
275 280 285 



lie Leu Glu Met lie Tyr Ser Leu Leu Ser Asn Thr Ser lie Phe Leu 
290 295 300 



Asp Pro Tyr lie His Ser Leu Met Pro Ser lie Leu Thr Leu Leu Leu 
305 310 315 320 



Ala Lys Lys Leu Gly Gly Ser Pro Lys Asp Asp Ser Pro Gin Glu lie 
325 330 335 



His Glu Phe Leu Glu Arg Thr Asn Ala Leu Arg Asp Phe Ala Ala Ser 
340 345 350 



Leu Leu Asp Tyr Val Leu Lys Lys Phe Pro Gin Ala Tyr Lys Ser Leu 
355 360 365 



Lys Pro Arg Val Thr Arg Thr Leu Leu Lys Thr Phe Leu Asp lie Asn 
370 375 380 



Arg Val Phe Gly Thr Tyr Tyr Gly Cys Leu Lys Gly Val Ser Val Leu 
385 390 395 400 



Glu Gly Glu Ser lie Arg Phe Phe Leu Gly Asn Leu Asn Asn Trp Ala 
405 410 415 



Arg Leu Val Phe Asn Glu Ser Gly lie Thr Leu Asp Asn lie Glu Glu 
420 425 430 



His Leu Asn Asp Asp Ser Asn Pro Thr Arg Thr Lys Phe Thr Lys Glu 
435 440 445 
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Glu Thr Gin lie Leu Val Asp Thr Val lie Ser 
450 455 



Lys Lys Asp Leu Pro Asp Leu Tyr Glu Gly Lys 
465 470 475 



Asp Glu Asp Lys Glu Lys Leu Leu Glu Arg Cys 
485 490 



Phe His lie Leu Lys Arg Asp Asp Ala Lys Glu 
500 505 



Phe Phe Gly Glu 
515 



<210> 62 
<211> 1551 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 62 

atgtctacac agcagcaatc 



gctgagtcgc ttgggttgga 
gttgaatacc gtattctaga 
agagatgttt taactacaga 
ttatatggat attatgatgg 
acaagtggag gacagtcggt 
ataaatgagc cattaccgca 
gttgaagggg ttcaacctgc 
caaccaccat ttattagagg 
cctgtaacgt cgacgacagc 
tctaatgtta aaccaggaca 
aaagaattac agatttattt 
gatgaagctg cacagcatat 
ctgcaccaac tggttccata 
tccgatttac aattgctaac 
tctattttcc tggaccctta 
gcaaaaaaac ttggtggatc 
gaaagaacga atgcactacg 



ttacacaatt tggtctccac 
aaatatcaac gatgacgtgt 
gattattgaa caggcagtca 
cgatgtatcg aaggccctgc 
ctctgaagtt aataaagctg 
ttactacctg gatgaggaag 
agtgcctcgt ctaccaactt 
cataatccaa aatccaaatt 
tgctattgtc actgctttga 
aagtgcttct gtgacggata 
gaatactgaa gtcaaacctt 
caataaggtc atttcaacct 
gaaacaagca gctttgactt 
ttttattcaa tttattgccg 
aacaattttg gaaatgattt 
tattcattcg ttaatgcctt 
accaaaagat gattcgccgc 
tgattttgcc gcatctttgt 



Ala Leu Leu Val Leu 
460 



Gly Glu Lys Val Thr 
480 

Gly Val Thr lie Gly 
495 

Leu lie Ser Ala lie 
510 



aagatactgt taaagatgtc 
tgaaagcact tgctatggac 
agtttaaaag acactctaaa 
gtgttctgaa tgtggaacct 
tatcatttag taaggttaac 
aagtagattt tgatagatta 
ttactacaca ttggctagca 
tgaatgatat aagggtatcc 
atgataacag cctccaaaca 
caggcgcctc tcaacacctg 
tagtgaaaca cgttttatcc 
tgacagcgaa gagtcaagct 
cactacgaac tgatagtggc 
aacaaatcac acaaaatctt 
actctttact aagcaatact 
ccattttaac cttgctatta 
aagaaattca tgaattttta 
tggactatgt attgaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



1080 
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tttcctcaag catacaaatc cctgaagcca agagtaacta ggacactgct aaagacattt 1140 
ttggacataa accgtgtttt tgggacttac tatgggtgct taaagggtgt atcggtacta 1200 
gaaggtgaat ccatcagatt cttcttagga aacctaaata attgggcgcg cttggtattc 1260 
aatgaaagcg gaataactct agacaatata gaggaacatt tgaacgatga ctccaatcca 1320 
actaggacca aatttactaa agaagaaact caaatcttgg ttgatactgt aattagtgca 1380 
ttgttagttc taaaaaagga tttaccagat ttgtacgagg gtaaaggtga aaaagttacg 1440 
gatgaggata aggaaaaatt gctagagagg tgtggtgtca ctattggatt tcacattttg 1500 
aaaagagacg acgcaaaaga attaattagt gcgatatttt ttggcgaata g 1551 



<210> 63 

<2li> 914 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 63 

Met Thr Asp Gin Arg Gly Pro Pro Pro Pro His Pro Gin Gin Ala Asn 
15 10 15 



Gly Tyr Lys Lys Phe Pro Pro His Asp Asn Gin Tyr Ser Gly Ala Asn 
20 25 30 



Asn Ser Gin Pro Asn Asn His Tyr Asn Glu Asn Leu Tyr Ser Ala Arg 
35 40 45 



Glu Pro His Asn Asn Lys Gin Tyr Gin Ser Lys Asn Gly Lys Tyr Gly 
50 55 60 



Thr Asn Lys Tyr Asn Asn Arg Asn Asn Ser Gin Gly Asn Ala Gin Tyr 
65 70 75 80 



Tyr Asn Asn Arg Phe Asn Asn Gly Tyr Arg Leu Asn Asn Asn Asp Tyr 
85 90 95 



Asn Pro Ala Met Leu Pro Gly Met Gin Trp Pro Ala Asn Tyr Tyr Ala 
100 105 110 



Pro Gin Met Tyr Tyr lie Pro Gin Gin Met Val Pro Val Ala Ser Pro 
115 120 125 



Pro Tyr Thr His Gin Pro Leu Asn Thr Asn Pro Glu Pro Pro Ser Thr 
130 135 140 



Pro Lys Thr Thr Lys lie Glu lie Thr Thr Lys Thr Gly Glu Arg Leu 
145 150 155 160 
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Asn 


Leu 


Lys 


Lys 


Phe 


His 


Glu 


Glu 


Lys 


Lys 


Ala 


Ser 


Lys 


Gly Glu 


Glu 










165 










170 










175 




Lys 


Asn 


Asp 


Gly Val 


Glu 


Gin 


Lys 


Ser 


Lys 


Ser 


Gly Thr 


Pro 


Phe 


Glu 








180 










185 










190 






Lys 


Glu 


Ala 


Thr 


Pro 


Val 


Leu 


Pro 


Ala 


Asn 


Glu 


Ala 


Val 


Lys 


Asp 


Thr 






195 










200 










205 








Leu 


Thr 


Glu 


Thr 


Ser 


Asn 


Glu 


Lys 


Ser 


Thr 


Ser 


Glu 


Ala 


Glu 


Asn 


Thr 




210 










215 










220 










Lys Arg 


Leu 


Phe 


Leu 


Glu 


Gin 


Val 


Arg 


Leu 


Arg 


Lys 


Ala 


Ala 


Met 


Glu 


225 










230 










235 










240 


Arg 


Lys 


Lys 


Asn 


Gly Leu 


lie 


Ser 


Glu 


Thr 


Glu 


Lys 


Lys 


Gin 


Glu 


Thr 










245 










250 










255 




Ser 


Asn 


His 


Asp 


Asn 


Thr 


Asp 


Thr 


Thr 


Lys 


Pro 


Asn 


Ser 


Val 


lie 


Glu 








260 










265 










270 






Ser 


Glu 


Pro 


lie 


Lys 


Glu 


Ala 


Pro 


Lys 


Pro 


Thr Gly 


Glu 


Ala 


Asn 


Glu 






275 










280 










285 








Val 


Val 


lie 


Asp 


Gly Lys 


Ser 


Gly Ala 


Ser 


Val 


Lys 


Thr' 

/ 


Pro 


Gin 


His 




290 










295 










300 








Val 


Thr 


Gly 


Ser 


Val 


Thr 


Lys 


Ser 


Val 


Thr 


Phe 


Asn 


Glu 


Pro 


Glu 


Asn 


305 










310 










315 










320 


Glu 


Ser 


Ser 


Ser 


Gin Asp 


Val 


Asp 


Glu 


Leu 


Val 


Lys 


Asp Asp Asp 


Thr 










325 










330 










335 




Thr 


Glu 


He 


Ser Asp 


Thr 


Thr Gly Gly Lys 


Thr 


Val 


Asn 


Lys 


Ser Asp 








340 










345 










350 






Asp 


Glu 


Thr 


lie 


Asn 


Ser 


Val 


lie 


Thr 


Thr 


Glu 


Glu 


Asn 


Thr 


Val 


Lys 






355 










360 










365 








Glu 


Thr 


Glu 


Pro 


Ser 


Thr 


Ser Asp 


He 


Glu 


Met 


Pro 


Thr 


Val 


Ser 


Gin 




370 










375 










380 










Leu 


Leu 


Glu 


Thr 


Leu 


Gly Lys 


Ala 


Gin 


Pro 


lie 


Ser Asp 


lie 


Tyr 


Glu 


385 










390 










395 










400 


Phe 


Ala 


Tyr 


Pro 


Glu 


Asn 


Val 


Glu Arg 


Pro 


Asp 


lie 


Lys 


Tyr 


Lys 


Lys 










405 










410 










415 
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Pro Ser Val Lys Tyr Thr Tyr Gly Pro Thr Phe Leu Leu Gin Phe Lys 
420 425 430 



Asp Lys Leu Lys Phe Arg Pro Asp Pro Ala Trp Val Glu Ala Val Ser 
435 440 445 



Ser Lys lie Val He Pro Pro His lie Ala Arg Asn Lys Pro Lys Asp 
450 455 460 



Ser Gly Arg Phe Gly Gly Asp Phe Arg Ser Pro Ser Met Arg Gly Met 
465 470 475 480 



Asp His Thr Ser Ser Ser Arg Val Ser Ser Lys Arg Arg Ser Lys Arg 
485 490 495 



Met Gly Asp Asp Arg Arg Ser Asn Arg Gly Tyr Thr Ser Arg Lys Asp 
500 505 510 



Arg Glu Lys Ala Ala Glu Lys Ala Glu Glu Gin Ala Pro Lys Glu Glu 
515 520 525 



lie Ala Pro Leu Val Pro Ser Ala Asn Arg Trp lie Pro Lys Ser Arg 
530 535 540 



Val Lys Lys Thr Glu Lys Lys Leu Ala Pro Asp Gly Lys Thr Glu Leu 
545 550 555 560 



Phe Asp Lys Glu Glu Val Glu Arg Lys Met Lys Ser Leu Leu Asn Lys 
565 570 575 



Leu Thr Leu Glu Met Phe Asp Ser lie Ser Ser Glu lie Leu Asp lie 
580 585 590 



Ala Asn Gin Ser Lys Trp Glu Asp Asp Gly Glu Thr Leu Lys lie Val 
595 600 605 



lie Glu Gin lie Phe His Lys Ala Cys Asp Glu Pro His Trp Ser Ser 
610 615 620 



Met Tyr Ala Gin Leu Cys Gly Lys Val Val Lys Asp Leu Asp Pro Asn 
625 630 635 640 



lie Lys Asp Lys Glu Asn Glu Gly Lys Asn Gly Pro Lys Leu Val Leu 
645 650 655 



His Tyr Leu Val Ala Arg Cys His Glu Glu Phe Glu Lys Gly Trp Ala 
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66 0 



665 



670 



Asp Lys Leu Pro Ala Gly Glu Asp Gly Asn Pro Leu Glu Pro Glu Met 
675 680 685 



Met Ser Asp Glu Tyr Tyr lie Ala Ala Ala Ala Lys Arg Arg Gly Leu 
690 695 700 



Gly Leu Val Arg Phe lie Gly Tyr Leu Tyr Cys Leu Asn Leu Leu Thr 
705 710 715 720 



Gly Lys Met Met Phe Glu Cys Phe Arg Arg Leu Met Lys Asp Leu Asn 
725 730 735 



Asn Asp Pro Ser Glu Glu Thr Leu Glu Ser Val lie Glu Leu Leu Asn 
740 745 750 



Thr Val Gly Glu Gin Phe Glu His Asp Lys Phe Val Thr Pro Gin Ala 
755 760 765 



Thr Leu Glu Gly Ser Val Leu Leu Asp Asn Leu Phe Met Leu Leu Gin 
770 775 780 



His lie lie Asp Gly Gly Thr lie Ser Asn Arg lie Lys Phe Lys Leu 
785 790 795 800 



lie Asp Val Lys Glu Leu Arg Glu lie Lys His Trp Asn Ser Ala Lys 
805 810 815 



Lys Asp Ala Gly Pro Lys Thr lie Gin Gin He His Gin Glu Glu Glu 
820 825 830 



Gin Leu Arg Gin Lys Lys Asn Ser Gin Arg Ser Asn Ser Arg Phe Asn 
835 840 845 



Asn His Asn Gin Ser Asn Ser Asn Arg Tyr Ser Ser Asn Arg Arg Asn 
850 855 860 



Met Gin Asn Thr Gin Arg Asp Ser Phe Ala Ser Thr Lys Thr Gly Ser 
865 870 875 880 



Phe Arg Asn Asn Gin Arg Asn Ala Arg Lys Val Glu Glu Val Ser Gin 
885 890 895 



Ala Pro Arg Ala Asn Met Phe Asp Ala Leu Met Asn Asn Asp Gly Asp 
900 905 910 
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Ser Asp 



<210> 64 

< 211 > 2000 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 64 



atgactgacc 


aaagaggtcc 


accgccccca 


cacccgcagc 


aagccaatgg 


ctacaagaaa 


60 


tttcctcctc 


atgataacca 


atactctgga 


gccaataata 


gtcagccaaa 


taaccactac 


120 


aatgaaaatc 


tttacagtgc 


aagggaacct 


cacaataaca 


agcaatacca 


gtcgaaaaat 


180 


ggcaaatacg 


ggacaaataa 


atacaataat 


cgtaataata 


gtcaaggaaa 


tgcacagtac 


240 


tacaataaca 


gattcaataa 


tggctacaga 


ctgaataaca 


acgactataa 


cccggcaatg 


300 


ttgccaggta 


tgcaatggcc 


agctaactac 


tatgctcctc 


agatgtatta 


tataccccaa 


360 


caaatggttc 


cagttgcttc 


tccaccatat 


acacaccaac 


cactcaacac 


taatcccgaa 


420 


cctccctcta 


cacctaaaac 


tacaaaaata 


gaaatcacta 


cgaaaactgg 


tgaacgttta 


480 


aatttgaaaa 


aattccacga 


agaaaagaaa 


gcttcgaaag 


gtgaagagaa 


gaatgatggg 


540 


gtagagcaga 


aatctaaatc 


aggaacccct 


tttgaaaagg 


aggcaactcc 


tgtactacct 


600 


gctaatgaag 


cagttaaaga 


tacgttaacc 


gaaacatcta 


atgaaaaatc 


tacctctgaa 


660 


gcggaaaata 


ctaagagatt 


atttttagag 


caggtgagat 


tgcgcaaagc 


tgccatggaa 


720 


agaaagaaaa 


acggtttaat 


ctccgaaact 


gagaagaagc 


aagaaacttc 


aaatcatgat 


780 


aatactgata 


caactaagcc 


caactcggtt 


attgaatctg 


aacctattaa 


ggaagcgccg 


840 


aaaccaacgg 


gagaagctaa 


tgaagttgtg 


attgatggaa 


aatcaggggc 


aagtgtcaaa 


900 


actccacaac 


atgtaactgg 


aagtgtaact 


aaatccgtga 


cttttaacga 


acccgagaat 


960 


gagagcagtt 


ctcaggatgt 


tgatgagctt 


gttaaagatg 


atgatactac 


tgaaatttcg 


1020 


gatacgactg 


gtggaaaaac 


tgtaaataaa 


a gtgacgacg 


aaacaataaa 


ttccgtaatc 


1080 


accacagagg 


agaacacagt 


gaaggagaca 


gaaccttcca 


cttccgatat 


tgagatgcca 


1140 


actgtatctc 


aactacttga 


aactttaggt 


aaggctcaac 


caatttctga 


catatatgag 


1200 


tttgcctatc 


cggaaaatgt 


tgaaaggcct 


gatatcaaat 


acaagaaacc 


aagcgttaag 


1260 


tacacttatg 


gccctacctt 


cttactgcaa 


ttcaaagata 


aactaaaatt 


caggcctgat 


1320 


cctgcgtggg 


ttgaagctgt 


atcctcgaaa 


attgttatac 


ctcctcatat 


agccagaaat 


1380 


aaaccaaaag 


atagtggcag 


atttggaggc 


gatttcagaa 


gtccatctat 


gcgcggtatg 


1440 


gaccatactt 


ccagttcaag 


agtttcatcg 


aaaagaagat 


caaagagaat 


gggtgacgac 


1500 


agaagatcta 


atagaggata 


tacctccaga 


aaagatcgcg 


aaaaagctgc 


agaaaaggca 


1560 


gaagagcaag 


ctccaaagga 


agaaatcgcc 


ccgttggttc 


cgagtgctaa 


tagatggata 


1620 
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cctaaatcaa gggttaaaaa aacagaaaag aagttagctc ctgacggcaa aactgaatta 
tttgacaagg aagaagtaga acgaaaaatg aagtctctat taaacaagtt aactttagaa 
atgtttgatt caatctcctc tgagattttg gatatagcaa accaatcaaa gtgggaagat 
gatggcgaaa cattgaaaat agttattgaa caaattttcc acaaggcttg tgatgaacct 
cattggtctt caatgtatgc tcagttatgt ggcaaagtgg tcaaagacct tgacccaaac 
attaaagata aagagaacga aggaaagaat ggaccaaagc ttgtgttgca ttatttagtg 
gcaagatgtc acgaagaatt 



<210> 65 

<211> 680 
<212> PHT 

<213 > Saccharomyces cerevisiae 
<400> 65 

Met Thr Gin Phe Thr Asp He Asp Lys Leu Ala Val Ser Thr lie Arg 
15 10 15 



lie Leu Ala Val Asp Thr Val Ser Lys Ala Asn Ser Gly His Pro Gly 
20 25 30 



Ala Pro Leu Gly Met Ala Pro Ala Ala His Val Leu Trp Ser Gin Met 
35 40 45 



Arg Met Asn Pro Thr Asn Pro Asp Trp lie Asn Arg Asp Arg Phe Val 
50 55 60 



Leu Ser Asn Gly His Ala Val Ala Leu Leu Tyr Ser Met Leu His Leu 
65 70 75 80 



Thr Gly Tyr Asp Leu Ser lie Glu Asp Leu Lys Gin Phe Arg Gin Leu 
85 90 95 



Gly Ser Arg Thr Pro Gly His Pro Glu Phe Glu Leu Pro Gly Val Glu 
100 105 110 



Val Thr Thr Gly Pro Leu Gly Gin Gly lie Ser Asn Ala Val Gly Met 
115 120 125 



Ala Met Ala Gin Ala Asn Leu Ala Ala Thr Tyr Asn Lys Pro Gly Phe 
130 135 140 



Thr Leu Ser Asp Asn Tyr Thr Tyr Val Phe Leu Gly Asp Gly Cys Leu 
145 150 155 160 



1680 

1740 

1800 

1860 

1920 

1980 

2000 
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Gin Glu Gly lie Ser Ser Glu Ala Ser Ser Leu Ala Gly His Leu Lys 
165 170 175 



Leu Gly Asn Leu lie Ala lie Tyr Asp Asp Asn Lys He Thr lie Asp 
180 185 190 



Gly Ala Thr Ser lie Ser Phe Asp Glu Asp Val Ala Lys Arg Tyr Glu 
195 200 205 



Ala Tyr Gly Trp Glu Val Leu Tyr Val Glu Asn Gly Asn Glu Asp Leu 
210 215 220 



Ala Gly lie Ala Lys Ala lie Ala Gin Ala Lys Leu Ser Lys Asp Lys 
225 230 235 240 



Pro Thr Leu lie Lys Met Thr Thr Thr lie Gly Tyr Gly Ser Leu His 
245 250 255 



Ala Gly Ser His Ser Val His Gly Ala Pro Leu Lys Ala Asp Asp Val 
260 265 270 



Lys Gin Leu Lys Ser Lys Phe Gly Phe Asn Pro Asp Lys Ser Phe Val 
275 280 285 



Val Pro Gin Glu Val Tyr Asp His Tyr Gin Lys Thr lie Leu Lys Pro 
290 295 300 



Gly Val Glu Ala Asn Asn Lys Trp Asn Lys Leu Phe Ser Glu Tyr Gin 
305 310 315 320 



Lys Lys Phe Pro Glu Leu Gly Ala Glu Leu Ala Arg Arg Leu Ser Gly 
325 330 335 



Gin Leu Pro Ala Asn Trp Glu Ser Lys Leu Pro Thr Tyr Thr Ala Lys 
340 345 350 



Asp Ser Ala Val Ala Thr Arg Lys Leu Ser Glu Thr Val Leu Glu Asp 
355 360 365 



Val Tyr Asn Gin Leu Pro Glu Leu lie Gly Gly Ser Ala Asp Leu Thr 
370 375 380 



Pro Ser Asn Leu Thr Arg Trp Lys Glu Ala Leu Asp Phe Gin Pro Pro 
385 390 395 400 



Ser Ser Gly Ser Gly Asn Tyr Ser Gly Arg Tyr lie Arg Tyr Gly lie 
405 410 415 
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Arg Glu 


His 


Ala 


Met 


Gly Ala 


lie 


Met 


Asn 


Gly 


lie 


Ser 


Ala 


Phe 


Gly 








420 










425 










430 






Ala 


Asn 


Tyr 


Lys 


Pro 


Tyr 


Gly Gly Thr 


Phe 


Leu 


Asn 


Phe 


Val 


Ser 


Tyr 






435 










440 










445 








Ala 


Ala 


Gly Ala 


Val 


Arg 


Leu 


Ser 


Ala 


Leu 


Ser Gly 


His 


Pro 


Val 


lie 




450 










455 










460 










Trp Val 


Ala 


Thr 


His 


Asp 


Ser 


lie 


Gly Val 


Gly Glu Asp Gly 


Pro 


Thr 


465 










470 










475 










480 


His 


Gin 


Pro 


He 


Glu 


Thr 


Leu 


Ala 


His 


Phe 


Arg 


Ser 


Leu 


Pro 


Asn 


lie 










485 










490 










495 




Gin 


Val 


Trp 


Arg 


Pro 


Ala 


Asp Gly Asn 


Glu 


Val 


Ser 


Ala 


Ala 


Tyr 


Lys 








500 










505 










510 






Asn 


Ser 


Leu 


Glu 


Ser 


Lys 


His 


Thr 


Pro 


Ser 


lie 


lie 


Ala 


Leu 


Ser 


Arg 






515 










520 










525 








Gin 


Asn 


Leu 


Pro 


Gin 


Leu 


Glu Gly 


Ser 


Ser 


lie 


Glu 


Ser 


Ala 


Ser 


Lys 




530 










535 










540 










Gly Gly Tyr Val 


Leu 


Gin Asp 


Val 


Ala 


Asn 


Pro 


Asp 


lie 


lie 


Leu 


Val 


545 










550 










555 










560 


Ala 


Thr 


Gly 


Ser 


Glu 


Val 


Ser 


Leu 


Ser 


Val 


Glu 


Ala 


Ala 


Lys 


Thr 


Leu 










565 










570 










575 




Ala 


Ala 


Lys 


Asn 


lie 


Lys Ala 


Arg 


Val 


Val 


Ser 


Leu 


Pro 


Asp 


Phe 


Phe 








580 










585 










590 






Thr 


Phe 


Asp 


Lys 


Gin 


Pro 


Leu 


Glu Tyr 


Arg 


Leu 


Ser 


Val 


Leu 


Pro Asp 






595 










600 










605 








Asn 


Val 


Pro 


lie 


Met 


Ser 


Val 


Glu 


Val 


Leu 


Ala 


Thr 


Thr 


Cys 


Trp 


Gly 




610 










615 










620 








• 


Lys 


Tyr Ala 


His 


Gin 


Ser 


Phe 


Gly 


lie 


Asp Arg 


Phe 


Gly Ala 


Ser 


Gly 


625 










630 










635 










640 


Lys 


Ala 


Pro 


Glu 


Val 


Phe 


Lys 


Phe 


Phe 


Gly Phe 


Thr 


Pro 


Glu Gly 


Val 










645 










650 










655 




Ala 


Glu 


Arg Ala 


Gin 


Lys 


Thr 


lie 


Ala 


Phe 


Tyr 


Lys 


Gly Asp 


Lys 


Leu 








66 0 










665 










670 
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He Ser Pro Leu Lys Lys Ala Phe 
675 680 

< 210 > 66 
< 211 > 2000 
<212 > DNA 

<213 > Saccharomyces cerevisiae 
<400> 66 



atgactcaat 


tcactgacat 


tgataagcta 


gccgtctcca 


ccataagaat 


tttggctgtg 


60 


gacaccgtat 


ccaaggccaa 


ctcaggtcac 


ccaggtgctc 


cattgggtat 


ggcaccagct 


120 


gcacacgttc 


tatggagtca 


aatgcgcatg 


aacccaacca 


acccagactg 


gatcaacaga 


180 


gatagatttg 


tcttgtctaa 


cggtcacgcg 


gtcgctttgt 


tgtattctat 


gctacatttg 


240 


actggttacg 


atctgtctat 


tgaagacttg 


aaacagttca 


gacagttggg 


ttccagaaca 


300 


ccaggtcatc 


ctgaatttga 


gttgccaggt 


gttgaagtta 


ctaccggtcc 


attaggtcaa 


360 


ggtatctcca 


acgctgttgg 


tatggccatg 


gctcaagcta 


acctggctgc 


cacttacaac 


420 


aagccgggct 


ttaccttgtc 


tgacaactac 


acctatgttt 


tcttgggtga 


cggttgtttg 


480 


caagaaggta 


tttcttcaga 


agcttcctcc 


ttggctggtc 


atttgaaatt 


gggtaacttg 


540 


attgccatct 


acgatgacaa 


caagatcact 


atcgatggtg 


ctaccagtat 


ctcattcgat 


600 


gaagatgttg 


ctaagagata 


cgaagcctac 


ggttgggaag 


ttttgtacgt 


agaaaatggt 


660 


aacgaagatc 


tagccggtat 


tgccaaggct 


attgctcaag 


ctaagttatc 


caaggacaaa 


720 


ccaactttga 


tcaaaatgac 


cacaaccatt 


ggttacggtt 


ccttgcatgc 


cggctctcac 


780 


tctgtgcacg 


gtgccccatt 


gaaagcagat 


gatgttaaac 


aactaaagag 


caaattcggt 


840 


ttcaacccag 


acaagtcctt 


tgttgttcca 


caagaagttt 


acgaccacta 


ccaaaagaca 


900 


attttaaagc 


caggtgtcga 


agccaacaac 


aagtggaaca 


agttgttcag 


cgaataccaa 


960 


aagaaattcc 


cagaattagg 


tgctgaattg 


gctagaagat 


tgagcggcca 


actacccgca 


1020 


aattgggaat 


ctaagttgcc 


aacttacacc 


gccaaggact 


ctgccgtggc 


cactagaaaa 


1080 


ttatcagaaa 


ctgttcttga 


ggatgtttac 


aatcaattgc 


cagagttgat 


tggtggttct 


1140 


gccgatttaa 


caccttctaa 


cttgaccaga 


tggaaggaag 


cccttgactt 


ccaacctcct 


1200 


tcttccggtt 


caggtaacta 


ctctggtaga 


tacattaggt 


acggtattag 


agaacacgct 


1260 


atgggtgcca 


taatgaacgg 


tatttcagct 


ttcggtgcca 


actacaaacc 


atacggtggt 


1320 


actttcttga 


acttcgtttc 


ttatgctgct 


ggtgccgtta 


gattgtccgc 


tttgtctggc 


1380 


cacccagtta 


tttgggttgc 


tacacatgac 


tctatcggtg 


tcggtgaaga 


tggtccaaca 


1440 


catcaaccta 


ttgaaacttt 


agcacacttc 


agatccctac 


caaacattca 


agtttggaga 


1500 


ccagctgatg 


gtaacgaagt 


ttctgccgcc 


tacaagaact 


ctttagaatc 


caagcatact 


1560 
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ccaagtatca 


ttgctttgtc 


cagacaaaac 


ttgccacaat 


tggaaggtag 


ctctattgaa 


1620 


agcgcttcta 


agggtggtta 


cgtactacaa 


gatgttgcta 


acccagatat 


tattttagtg 


1680 


gctactggtt 


ccgaagtgtc 


tttgagtgtt 


gaagctgcta 


agactttggc 


cgcaaagaac 


1740 


atcaaggctc 


gtgttgtttc 


tctaccagat 


ttcttcactt 


ttgacaaaca 


acccctagaa 


1800 


tacagactat 


cagtcttacc 


agacaacgtt 


ccaatcatgt 


ctgttgaagt 


tttggctacc 


1860 


acatgttggg 


gcaaatacgc 


tcatcaatcc 


ttcggtattg acagatttgg 


tgcctccggt 


1920 


aaggcaccag 


aagtcttcaa 


gttcttcggt 


ttcaccccag aaggtgttgc 


tgaaagagct 


1980 


caaaagacca 


ttgcattcta 










2000 



<210> 67 

<211> 196 

<212> PRT 

<213 > Saccharomyces cerevislae 
<400 > 67 

Met Val Ala Gin Val Gin Lys Gin Ala Pro Thr Phe Lys Lys Thr Ala 
15 10 15 



Val Val Asp Gly Val Phe Asp Gin Val Ser Leu Asp Lys Tyr Lys Gly 
20 25 30 



Lys Tyr Val Val Leu Ala Phe He Pro Leu Ala Phe Thr Phe Val Cys 
35 40 45 



Pro Thr Glu lie lie Ala Phe Ser Glu Ala Ala Lys Lys Phe Glu Glu 
50 . 55 60 



Gin Gly Ala Gin Val Leu Phe Ala Ser Thr Asp Ser Glu Tyr Ser Leu 
65 70 75 80 



Leu Ala Trp Thr Asn lie Pro Arg Lys Glu Gly Gly Leu Gly Pro lie 
85 90 95 



Asn lie Pro Leu Leu Ala Asp Thr Asn His Ser Leu Ser Arg Asp Tyr 
100 105 110 



Gly Val Leu lie Glu Glu Glu Gly Val Ala Leu Arg Gly Leu Phe lie 
115 120 125 



He Asp Pro Lys Gly Val lie Arg His lie Thr lie Asn Asp Leu Pro 
130 135 140 



Val Gly Arg Asn Val Asp Glu Ala Leu Arg Leu Val Glu Ala Phe Gin 
145 150 155 160 
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Trp Thr Asp Lys Asn Gly Thr Val Leu Pro Cys Asn Trp Thr Pro Gly 
165 170 175 



Ala Ala Thr lie Lys Pro Thr Val Glu Asp Ser Lys Glu Tyr Phe Glu 
180 185 190 



Ala Ala Asn Lys 
195 



< 210 > 68 
<211> 591 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 68 

atggtcgctc aagttcaaaa gcaagctcca acttttaaga aaactgccgt cgtcgacggt 60 

gtctttgacg aagtctcctt ggacaaatac aagggtaagt acgttgtcct agcctttatt 120 

ccattggcct tcactttcgt ctgtccaacc gaaatcattg ctttctcaga agctgctaag 180 

aaattcgaag aacaaggcgc tcaagttctt ttcgcctcca ctgactccga atactccctt 240 

ttggcatgga ccaatatccc aagaaaggaa ggtggtttgg gcccaatcaa cattccattg 300 

ttggctgaca ccaaccactc tttgtccaga gactatggtg tcttgatcga agaagaaggt 360 

gtcgccttga gaggtttgtt catcatcgac ccaaagggtg tcattagaca catcaccatt 420 

aacgatttgc cagtcggtag aaacgttgac gaagccttga gattggttga agccttccaa 480 

tggaccgaca agaacggtac tgtcttgcca tgtaactgga ctccaggtgc tgctaccatc 540 

aagccaaccg. ttgaagactc caaggaatac ttcgaagctg ccaacaaata a 591 



<210> 69 

<211> 291 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 69 

Met Asn Ser lie Leu Asp Arg Asn Val Arg Ser Ser Glu Thr Thr Leu 
1 5 10 15 



He Lys Pro Glu Ser Glu Phe Asp Asn Trp Leu Ser Asp Glu Asn Asp 
20 25 30 



Gly Ala Ser His lie Asn Val Asn Lys Asp Ser Ser Ser Val Leu Ser 
35 40 45 



Ala Ser Ser Ser Thr Trp Phe Glu Pro Leu Glu Asn lie lie Ser Ser 
50 55 60 
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Ala Ser Ser Ser Ser lie Gly Ser Pro He Glu Asp Gin Phe lie Ser 
65 70 75 80 



Ser Asn Asn Glu Glu Ser Ala Leu Phe Pro Thr Asp Gin Phe Phe Ser 
85 90 95 



Asn Pro Ser Ser Tyr Ser His Ser Pro Glu Val Ser Ser Ser He Lys 
100 105 110 



Arg Glu Glu Asp Asp Asn Ala Leu Ser Leu Ala Asp Phe Glu Pro Ala 
115 120 125 



Ser Leu Gin Leu Met Pro Asn Met lie Asn Thr Asp Asn Asn Asp Asp 
130 135 140 



Ser Thr Pro Leu Lys Asn Glu lie Glu Leu Asn Asp Ser Phe lie Lys 
145 150 155 160 



Thr Asn Leu Asp Ala Lys Glu Thr Lys Lys Arg Ala Pro Arg Lys Arg 
165 170 175 



Leu Thr Pro Phe Gin Lys Gin Ala His Asn Lys lie Glu Lys Arg Tyr 
180 185 190 



Arg lie Asn lie Asn Thr Lys lie Ala Arg Leu Gin Gin lie lie Pro 
195 200 205 



Trp Val Ala Ser Glu Gin Thr Ala Phe Glu Val Gly Asp Ser Val Lys 
210 215 220 



Lys Gin Asp Glu Asp Gly Ala Glu Thr Ala Ala Thr Thr Pro Leu Pro 
225 230 235 240 



Ser Ala Ala Ala Thr Ser Thr Lys Leu Asn Lys Ser Met lie Leu Glu 
245 250 255 



Lys Ala Val Asp Tyr lie Leu Tyr Leu Gin Asn Asn Glu Arg Leu Tyr 
260 265 270 



Glu Met Glu Val Gin Arg Leu Lys Ser Glu lie Asp Thr Leu Lys Gin 
275 280 285 



Asp Gin Lys 
290 



< 210 > 70 

< 211 > 876 
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<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 70 



atgaactcta 


ttttagacag 


aaatgttaga 


tctagcgaaa 


ctactttaat 


taaacctgaa 


60 


tctgaatttg 


ataattggtt 


gtcggatgaa 


aatgacggag 


ctagtcatat 


caacgtcaac 


120 


aaggactcct 


cgtcagttct 


ttctgcatct 


tcttccacat 


ggttcgaacc 


attggaaaac 


180 


attatctcct 


ctgcatccag 


ctcctcgata 


ggctctccaa 


tcgaagacca 


gtttatatct 


240 


tccaacaacg 


aggaatctgc 


tctttttcca 


acagatcagt 


ttttcagtaa 


tccttcctca 


300 


tactcgcatt 


ctcccgaggt 


tagcagctcg 


ataaaaagag 


aagaggatga 


caatgccctt 


360 


tcgttggcag 


attttgaacc 


ggcttctttg 


caattaatgc 


ctaacatgat 


aaatactgat 


420 


aataatgacg 


atagtacccc 


acttaagaat 


gaaatcgagc 


taaacgactc 


gtttataaaa 


480 


acaaatctag 


atgctaagga 


aacgaaaaag 


agggctccaa 


gaaaaagact 


gacccccttc 


540 


caaaagcaag 


ctcacaacaa 


gattgaaaaa 


cgctacagaa 


taaacatcaa 


cacaaagatt 


600 


gcaagactgc 


agcagattat 


cccatgggta 


gcaagtgaac 


aaacagcttt 


cgaagtaggt 


660 


gattctgtaa 


aaaaacagga 


cgaagacggc 


gcagaaactg 


ccgctactac 


tcctcttcca 


720 


tctgccgctg 


ctacaagcac 


gaagctaaat 


aaaagcatga 


tcctagaaaa 


agctgttgac 


780 


tatattctat 


atctacaaaa 


taacgaacga 


ctatacgaaa 


tggaagttca 


aaggttgaaa 


840 


agtgaaatcg 


acactttgaa 


acaagaccaa 


aaataa 






876 



<210> 71 

<211> 362 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 71 

Met lie Asn Asn Pro Lys Val Asp Ser Val Ala Glu Lys Pro Lys Ala 
15 10 15 

Val Thr Ser Lys Gin Ser Glu Gin Ala Ala Ser Pro Glu Pro Thr Pro 
20 25 30 

Ala Pro Pro Val Ser Arg Asn Gin Tyr Pro He Thr p he Asn Leu Thr 
35 40 45 

Ser Thr Ala Pro Phe His Leu His Asp Arg His Arg Tyr Leu Gin Glu 
50 55 60 

Gin Asp Leu Tyr Lys Cys Ala Ser Arg Asp Ser Leu Ser Ser Leu Gin 

65 70 75 80 

Gin Leu Ala His Thr Pro Asn Gly Ser Thr Arg Lys Lys Tyr lie Val 






WO 02/092626 
























PCT/EP02/05359 




















116/148 




















85 










90 










95 




Glu Asp 


Gin 


Ser 


Pro 


Tyr 


Ser 


Ser 


Glu 


Asn 


Pro 


Val 


lie 


Val 


Thr 


Ser 








100 










105 










110 






Ser 


Tyr Asn 


His 


Thr 


Val 


Cys 


Thr 


Asn 


Tyr 


Leu Arg 


Pro Arg 


Met 


Gin 






115 










120 










125 








Phe 


Thr Gly 


Tyr 


Gin 


He 


Ser 


Gly 


Tyr 


Lys 


Arg 


Tyr 


Gin 


Val 


Thr 


Val 




130 










135 










140 










Asn 


Leu 


Lys 


Thr 


Val 


Asp 


Leu 


Pro 


Lys 


Lys 


Asp 


Cys 


Thr 


Ser 


Leu 


Ser 


145 










150 










155 










160 


Pro 


His 


Leu 


Ser 


Gly Phe 


Leu 


Ser 


He 


Arg Gly 


Leu 


Thr 


Asn 


Gin 


His 










165 










170 










175 




Pro 


Glu 


lie 


Ser 


Thr 


Tyr 


Phe 


Glu 


Ala 


Tyr Ala 


Val 


Asn 


His 


Lys 


Glu 








180 










185 










190 






Leu Gly 


Phe 


Leu 


Ser 


Ser 


Ser 


Trp 


Lys 


Asp 


Glu 


Pro 


Val 


Leu 


Asn 


Glu 






195 










200 










205 








Phe 


Lys 


Ala 


Thr 


Asp 


Gin 


Thr 


Asp 


Leu 


Glu 


His 


Trp 


lie 


Asn 


Phe 


Pro 




210 










215 










220 










Ser 


Phe 


Arg 


Gin 


Leu 


Phe 


Leu 


Met 


Ser 


Gin 


Lys 


Asn Gly 


Leu 


Asn 


Ser 


225 










230 










235 










240 


Thr Asp 


Asp 


Asn 


Gly Thr 


Thr 


Asn 


Ala 


Ala 


Lys 


Lys 


Leu 


Pro 


Pro 


Gin 










245 










250 










255 




Gin 


Leu 


Pro 


Thr 


Thr 


Pro 


Ser 


Ala 


Asp 


Ala 


Gly Asn 


lie 


Ser 


Arg 


lie 








260 










265 










270 






Phe 


Ser 


Gin 


Glu 


Lys 


Gin 


Phe 


Asp 


Asn 


Tyr 


Leu 


Asn 


Glu Arg 


Phe 


lie 






275 










280 










285 








Phe 


Met 


Lys 


Trp 


Lys 


Glu 


Lys 


Phe 


Leu 


Val 


Pro 


Asp Ala 


Leu 


Leu 


Met 




290 










295 










300 










Glu 


Gly Val 


Asp 


Gly Ala 


Ser Tyr Asp 


Gly Phe 


Tyr 


Tyr 


lie 


Val 


His 


305 










310 










315 










320 


Asp 


Gin 


Val 


Thr 


Gly Asn 


lie 


Gin 


Gly 


Phe 


Tyr 


Tyr 


His 


Gin Asp 


Ala 










325 










330 










335 
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Glu Lys Phe Gin Gin Leu Glu Leu Val Pro Ser Leu Lys Asn Lys Val 
340 345 350 

Glu Ser Ser Asp Cys Ser Phe Glu Phe Ala 
355 360 

<210> 72 

<211> 1089 

<212 > DNA 

<213> Saccharomyces cerevisiae 
<400> 72 



atgatcaata 


atcctaaggt 


agacagtgta 


gcggagaaac 


ccaaagctgt 


gacatcaaag 


60 


cagtcggagc 


aagcggcttc 


gccagaacca 


acaccagccc 


cccctgtttc 


tagaaatcag 


120 


tatccgatca 


cgttcaactt 


gacttcaacc 


gcaccctttc 


atcttcatga 


ccgtcatcgc 


180 


tacttacaag 


agcaagatct 


ttacaagtgc 


gcttctaggg attcgttgtc 


ttccctgcag 


240 


caactcgccc 


atacacctaa 


tgggtcgaca 


aggaagaaat 


atattgttga 


ggaccaatct 


300 


ccctatagtt 


cggaaaatcc 


agtcattgtg 


acctcttcct 


ataaccatac 


ggtttgcaca 


360 


aactacttaa 


gaccaagaat 


gcagtttaca 


gggtaccaga 


tatcaggata 


caaacgctat 


420 


caggtaacag 


ttaacttaaa 


gactgtagat 


ttgccaaaga 


aagattgcac 


gtcgctgtcc 


480 


cctcatttat 


ctggattttt 


gtcgataaga 


ggactcacga 


accaacaccc 


ggaaatcagt 


540 


acatattttg 


aagcctacgc 


ggtaaaccac 


aaggaattag 


ggttcttgtc 


ctctagctgg 


600 


aaagatgaac 


ctgttttaaa 


cgaattcaaa 


gccacagacc 


aaacagactt 


agaacactgg 


660 


ataaatttcc 


cctcctttag 


acagcttttc 


ctgatgagcc 


aaaaaaacgg 


tctcaactca 


720 


actgacgaca 


atggcactac 


aaatgcagcc 


aagaagttgc 


ctccacagca 


gcttcctact 


780 


acaccaagcg 


cagacgctgg 


taacatatca 


agaattttta gccaagagaa 


acaatttgac 


840 


aactacttga 


acgaacggtt 


tatatttatg 


aaatggaagg aaaaattttt 


ggtaccagat 


900 


gccctattaa 


tggaaggtgt 


agacggcgca 


tcttatgatg ggttttatta 


cattgtccat 


960 


gatcaagtta 


ccgggaacat 


tcaagggttt 


tactatcatc 


aagatgctga 


aaagttccaa 


1020 


cagctggaat 


tagtaccatc 


tttgaaaaat 


aaagtcgagt 


ccagtgattg 


ttcttttgag 


1080 



tttgcttga 1089 

<210> 73 

<211> 822 
<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 73 

Met Leu Glu Gly Thr Val Asp Tyr Asp Pro Leu Glu Asp lie Thr Asn 
15 10 15 
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He 


Leu 


Phe 


Ser 


Lys 


Glu 


Ser 


Leu 


Asn 


Asn 


lie 


Asp 


Glu 


Leu 


lie 


Ser 








20 










25 










30 






lie 


Thr 


Arg 


Ser 


Tyr 


Lys 


Lys 


Gin 


Leu 


Gin 


Glu 


Asp 


lie 


Leu 


Lys 


Glu 






35 










40 










45 








Glu 


Asn 


Glu 


Leu 


Lys 


Glu 


His 


Pro 


Lys 


Asn 


Ser 


Ala 


Glu 


lie 


Glu 


Ala 




50 










55 










60 










Ser 


Leu Arg 


Lys 


Val 


Phe 


Gin Asp 


Phe 


Lys 


Glu 


Thr 


Gin Asp 


Val 


Ser 


65 










70 










75 










80 


Ala 


Ser 


Thr 


Glu 


Leu 


Thr 


lie 


Ser 


Asn 


Leu 


Thr 


Glu Gly 


lie 


Ser 


Tyr 










85 










90 










95 




Leu Asp 


lie 


Ala 


Lys 


Lys 


Asn 


Leu 


Thr 


His 


Ser 


Leu 


Thr 


Leu 


Phe 


Gin 








100 










105 










110 






Asn 


Leu 


Lys 


lie 


Leu 


Thr 


Asp 


Ser 


Tyr 


He 


Gin 


Cys 


Asn 


Glu 


Leu 


Leu 






115 










120 










125 








Ser 


Gin 


Gly 


Ser 


Phe 


Lys 


Lys 


Met 


Val 


Ser 


Pro 


Tyr 


Lys 


lie 


Met 


Cys 




130 










135 










140 










Ser 


Leu 


Ala 


Glu 


Asn 


Thr 


Phe 


lie 


Ser 


Tyr 


Lys 


Ser 


Leu 


Asp 


Glu 


lie 


145 










150 










155 










160 


Asn 


Tyr 


Leu 


Leu 


Ser 


Ser 


lie 


Ser 


Arg 


Leu 


Lys 


Gly Asp 


Thr 


Leu 


Ser 










165 










170 










175 




Lys 


lie 


Lys 


Gin 


Asn 


Tyr 


Asn 


Ala 


Leu 


Phe 


Ser Gly Gly 


Asn 


lie 


Ser 








180 










185 










190 






Glu 


His 


Asp 


Thr 


Ala 


Leu 


Thr 


Met 


Glu 


Leu Arg 


Glu Gly 


Ala 


Cys 


Glu 






195 










200 










205 








Leu 


Leu Asp 


Cys 


Asp 


Thr 


Ser 


Thr 


Arg Ala 


Gin 


Met 


lie 


Asp 


Trp 


Cys 




210 










215 










220 










Leu Asp 


Lys 


Leu 


Leu 


Phe 


Glu 


Met 


Lys 


Glu 


lie 


Phe 


Arg 


Val 


Asp Asp 


225 










230 










235 










240 


Glu 


Ala 


Gly 


Ser 


Leu 


Glu 


Asn 


Leu 


Ser 


Arg 


Arg 


Tyr 


lie 


Tyr 


Phe 


Lys 










245 










250 










255 




Lys 


lie 


Leu 


Asn 


Asn 


Phe 


Asn 


Ser 


Lys 


Phe 


Ala 


Asp 


Tyr 


Phe 


Leu 


Lys 








260 










265 










270 
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Asp Trp 


Glu 


Met 


Ala 


Val 


Arg 


Leu 


Thr 


Thr 


Thr 


Phe 


Tyr 


His 


lie 


Thr 






275 










280 










285 








His 


Lys 


Asp 


Leu 


Gin 


Thr 


Leu 


Leu 


Lys 


Arg 


Glu 


Phe 


Lys 


Asp 


Lys 


Asn 




290 










295 










300 










Pro 


Ser 


He 


Asp 


Leu 


Phe 


Met 


Thr 


Ala 


Leu 


Gin 


Ser 


Thr 


Leu 


Asp 


Phe 


305 










310 










315 










320 


Glu 


Lys 


Tyr 


lie 


Asp 


Val 


Arg 


Phe 


Ser 


Lys 


Lys 


lie 


Lys 


Glu 


Pro 


Lys 










325 










330 










335 




Leu 


Ser 


Ser 


Cys 


Phe 


Glu 


Pro 


Tyr 


Leu 


Thr 


Leu 


Trp 


Val 


Ser 


His 


Gin 








340 










345 










350 






Asn 


Gin 


Met 


Met 


Glu 


Lys 


Lys 


Phe 


Leu 


Ser 


Tyr 


Met 


Ser 


Glu 


Pro 


Lys 






355 










360 










365 








Tyr 


Pro 


Ser 


Asn 


Glu 


Thr 


Glu 


Ser 


Leu 


Val 


Leu 


Pro 


Ser 


Ser 


Ala 


Asp 




370 










375 










380 










Leu 


Phe 


Arg 


Thr 


Tyr 


Arg 


Ser 


Val 


Leu 


Thr 


Gin 


Thr 


Leu 


Glu 


Leu 


lie 


385 










390 










395 










400 


Asp Asn 


Asn 


Ala 


Asn Asp 


Ser 


lie 


Leu 


Thr 


Ser 


Leu 


Ala 


Asn 


Phe 


Phe 










405 










410 










415 




Ser 


Arg 


Trp 


Leu 


Gin 


Thr 


Tyr 


Ser 


Gin 


Lys 


lie 


Leu 


Leu 


Pro 


Leu 


Leu 








420 










425 










430 






Leu 


Pro Asp 


Asn 


lie 


Glu 


Val 


Gin Asp 


Lys 


Leu 


Glu 


Ala 


Ala 


Lys 


Tyr 






435 










440 










445 








Thr 


Val 


Leu 


Leu 


lie 


Asn 


Thr 


Ala Asp 


Tyr 


Cys 


Ala 


Thr 


Thr 


lie 


Asp 




450 










455 










460 










Gin 


Leu 


Glu Asp 


Lys 


Leu 


Ser 


Glu 


Phe 


Ser 


Gly Asn Arg 


Glu 


Lys 


Leu 


465 










470 










475 










480 


Ala 


Asn 


Ser 


Phe 


Thr 


Lys 


Thr 


Lys 


Asn 


lie 


Tyr Asp Asp 


Leu 


Leu 


Ala 










485 










490 










495 




Lys 


Gly Thr 


Ser 


Phe 


Leu 


Leu 


Asn 


Arg 


Val 


lie 


Pro 


Leu Asp 


Leu 


Asn 








500 










505 










510 






Phe 


Val 


Trp Arg 


Glu 


Phe 


lie 


Asn 


Asn 


Asp 


Trp 


Ser 


Asn 


Ala 


Ala 


lie 
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515 










520 










525 








Glu Asp 


Tyr 


Ser Arg 


Tyr 


Met 


Val 


Thr 


Leu 


Lys 


Ser 


Val 


Leu 


Lys 


Met 




530 










535 










540 










Pro 


Ala 


Leu 


Thr Asp 


Ala 


Ser 


lie 


Lys 


Gin 


Gin 


Gin 


Glu 


Gin 


Pro 


Ser 


545 










550 










555 










560 


Thr 


Leu 


Ala 


Phe 


lie 


Leu 


Ser 


Gin 


Phe 


Asn Arg Asp 


Val 


Tyr 


Lys 


Trp 










565 










570 










575 




Asn 


Phe 


Leu 


Asp 


Lys 


Val 


He 


Asp 


lie 


lie 


Thr 


Thr 


Asn 


Phe 


Val 


Ser 








580 










585 










590 






Asn 


Thr 


lie 


Arg 


Leu 


Leu 


Gin 


Pro 


Val 


Pro 


Pro 


Phe 


Ser 


Leu Ala Gly 






595 










600 










605 








Ser 


Lys 


Arg 


Lys 


Phe 


Glu 


Thr 


Arg 


Thr 


Val 


Val 


Asn 


lie 


Gly 


Glu 


Gin 




610 










615 










620 










Leu 


Leu 


Leu 


Asp 


Leu 


Glu 


Leu 


Leu 


Lys 


Glu 


lie 


Phe 


His 


Thr 


Leu 


Pro 


625 










630 










635 










64 0 


Glu 


Ser 


Val 


Ser 


Asn 


Asp 


Ser 


Asp 


Leu 


Arg 


Glu 


Asn 


Thr 


Ser 


Tyr 


Lys 










645 










650 










655 




Arg 


Val 


Lys 


Arg 


His 


Ala 


Asp 


Asn 


Asn 


lie 


Asp 


Gin 


Leu 


Leu 


Lys 


Phe 








660 










665 










670 






He 


Lys 


Leu 


Leu 


Met 


Ala 


Pro 


Leu 


Asp 


Ser 


Ala Asp 


Asp 


Tyr 


Tyr 


Glu 






675 










680 










685 








Thr 


Tyr 


Ser 


Lys 


Leu 


Thr 


Asn 


Asn 


Asn 


Pro Asp 


Ser 


Ala 


Val 


Trp 


Ser 




690 










695 










700 










Phe 


Val 


Leu 


Ala 


Leu 


Lys 


Gly 


lie 


Pro 


Trp Asp 


Leu 


Ala 


Leu 


Trp Lys 


705 










710 










715 










720 


Lys 


Leu 


Trp 


Ser 


Ala Tyr 


Asn 


Leu 


Glu 


Thr Asp Asp 


Thr Asp 


Glu Gly 










725 










730 










735 




Ser Arg 


Pro 


Asp 


Ser 


Asn 


Arg Asp 


Leu 


Phe 


He 


Phe 


Lys 


Trp Asp 


Lys 








740 










745 










750 






Val 


Leu 


Leu 


Gly Gin 


Phe 


Glu 


Asn 


Asn 


Leu 


Ala 


Arg 


Met 


Gin Asp 


Pro 






755 










760 










765 
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Asn Trp Ser Lys Phe Val Arg Gin Asp Leu Lys He Ser Pro Pro Val 
770 775 780 



Met Lys Arg lie Val Ser Thr Pro Gin lie Gin Gin Gin Lys Glu Glu 
785 790 795 800 



Gin Lys Lys Gin Ser Leu Ser Val Lys Asp Phe Val Ser His Ser Arg 
805 810 815 



Phe Phe Asn Arg Gly Thr 
820 



<210> 74 

< 211 > 2000 
<212> DNA 



<213 > Saccharomyces cerevisiae 
<400> 74 

atgctggaag gtacggtaga ttatgacccg 
aaagaatccc tgaacaacat agatgaactg 
ttgcaagagg atattctcaa agaagagaat 
gaaatagagg cttctctgag gaaagttttc 
gcctccaccg agttgacgat atcgaatctg 
aagaaaaacc tcacccactc tttgactctt 
tacatacaat gcaatgaatt actctcacag 
aagataatgt gttcgcttgc tgaaaacaca 
aactatttgt tgagctccat ttcaagactg 
aactacaatg cgctcttttc cggcggcaat 
gaattgcgcg aaggtgcctg cgagctactc 
atagattggt gtttggacaa acttctcttc 
gaagccggat ccctagaaaa tttatcgaga 
aacttcaatt caaagttcgc agactatttc 
accaccactt tttatcacat tacacacaag 
aaagacaaga acccttccat tgatctattc 
gaaaaataca tcgacgtacg attttcaaaa 
ttcgaacctt atttgacttt atgggtgtct 
ctttcttata tgagtgagcc gaagtaccca 
tcgagtgcag accttttcag gacatatcgt 
gataataatg ccaatgatag catattgact 



ctggaagata ttaccaatat acttttttca 
atcagtatta ccagaagcta caaaaagcaa 
gaattgaagg aacaccctaa aaattccgct 
caagatttca aagaaactca agatgtctca 
acagaaggta tctcgtacct ggacattgcc 
ttccaaaatt taaagatatt gacagacagt 
ggctcattca aaaaaatggt gtccccttat 
ttcatctctt acaaatcatt ggacgagata 
aaaggagaca ctttgtccaa aattaaacaa 
atctcagagc atgatacagc actcactatg 
gactgcgata caagtacgag agcccagatg 
gaaatgaaag agatatttag ggtcgacgat 
agatacattt acttcaaaaa aattcttaat 
ttaaaagact gggaaatggc agtcagattg 
gaccttcaga cacttctgaa aagggaattc 
atgacagcat tacaatcgac gctagatttc 
aaaattaagg aaccaaaact aagttcctgc 
caccaaaacc aaatgatgga aaagaaattt 
tctaatgaaa cagaatctct cgtgttaccc 
tccgtactga ctcagacctt agagctcatt 
tcattggcaa attttttcag tagatggctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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caaacttact 


cacaaaaaat 


tcttcttcct 


ttactgctgc 


ccgacaatat 


tgaagtccag 


1320 


gataagctag 


aagctgccaa 


gtataccgtt 


ttattgatca 


atactgcaga 


ttattgtgcc 


1380 


acgactatag 


atcaattgga 


ggataaatta 


tctgaattca 


gcggtaatcg 


tgaaaagctg 


1440 


gcaaacagtt 


ttacgaaaac 


gaaaaatata 


tacgacgatt 


tactagcaaa 


aggaacttct 


1500 


tttctattaa 


accgtgtcat 


acccttagat 


ctaaattttg 


tatggagaga 


gtttatcaac 


1560 


aatgattggt 


caaatgctgc 


gatagaagat 


tatagcaggt 


acatggtaac 


cctcaaatcc 


1620 


gtacttaaaa 


tgcccgcatt 


aacagatgcc 


tctattaaac 


aacagcaaga 


gcaaccttcg 


1680 


actttggcat 


ttattttgtc 


gcaattcaat 


agagatgttt 


ataagtggaa 


tttcttggat 


1740 


aaggtgattg 


atatcatcac 


tacaaatttt 


gtaagcaata 


ccatccgcct 


tctgcagccc 


1800 


gttccaccct 


tttccctggc 


gggcagcaaa 


aggaaatttg 


aaaccagaac 


tgttgtcaac 


1860 


attggcgagc 


agcttctcct 


tgatttagaa 


ttgctgaagg 


agatttttca 


cactttacca 


1920 


gaaagtgtaa 


gtaacgattc 


tgacttgcga 


gaaaatacct 


cttataagag 


ggtgaaaaga 


1980 


catgcagaca 


ataatataga 










2000 



<210> 75 

<21 1> 779 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<4 00> 75 

Met Glu Arg Thr Asn Thr Thr Thr Phe Lys Phe Phe Ser Leu Gly Gly 

1 5 10 15 

Ser Asn Glu Val Gly Arg Ser Cys His He Leu Gin Tyr Lys Gly Lys 

20 25 30 

Thr Val Met Leu Asp Ala Gly He His Pro Ala Tyr Gin Gly Leu Ala 
35 40 45 

Ser Leu Pro Phe Tyr Asp Glu Phe Asp Leu Ser Lys Val Asp lie Leu 
50 55 60 

Leu He Ser His Phe His Leu Asp His Ala Ala Ser Leu Pro Tyr Val 

65 70 75 80 

Met Gin Arg Thr Asn Phe Gin Gly Arg Val Phe Met Thr His Pro Thr 

85 90 95 

Lys Ala lie Tyr Arg Trp Leu Leu Arg Asp Phe Val Arg Val Thr Ser 
100 105 110 
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lie 


Gly 


Ser 


Ser 


Ser 


Ser 


Ser 


Met 


Gly Thr 


Lys 


Asp 


Glu 


Gly Leu 


Phe 






115 










120 










125 








Ser 


Asp 


Glu Asp 


Leu 


Val 


Asp 


Ser 


Phe 


Asp 


Lys 


lie 


Glu 


Thr 


Val 


Asp 




130 










135 










140 










Tyr 


His 


Ser 


Thr 


Val 


Asp 


Val 


Asn Gly 


lie 


Lys 


Phe 


Thr 


Ala 


Phe 


His 


145 










150 










155 










160 


Ala 


Gly His 


Val 


Leu 


Gly Ala 


Ala 


Met 


Phe 


Gin 


lie 


Glu 


lie 


Ala 


Gly 










165 










170 










175 




Leu Arg 


Val 


Leu 


Phe 


Thr 


Gly Asp 


Tyr 


Ser 


Arg 


Glu 


Val 


Asp 


Arg 


His 








180 










185 










190 






Leu 


Asn 


Ser 


Ala 


Glu 


Val 


Pro 


Pro 


Leu 


Ser 


Ser 


Asn 


Val 


Leu 


lie 


Val 






195 










200 










205 








Glu 


Ser 


Thr 


Phe 


Gly Thr 


Ala 


Thr 


His 


Glu 


Pro Arg 


Leu 


Asn Arg 


Glu 




210 










215 










220 










Arg 


Lys 


Leu 


Thr 


Gin 


Leu 


He 


His 


Ser 


Thr 


Val 


Met 


Arg 


Gly Gly Arg 


225 










230 










235 










240 


Val 


Leu 


Leu 


Pro 


Val 


Phe 


Ala 


Leu Gly Arg 


Ala 


Gin 


Glu lie 


Met 


Leu 










245 




* 






250 










255 




lie 


Leu 


Asp 


Glu 


Tyr 


Trp 


Ser 


Gin 


His 


Ala 


Asp 


Glu 


Leu 


Gly Gly Gly 








260 










265 










270 






Gin 


Val 


Pro 


lie 


Phe 


Tyr 


Ala 


Ser 


Asn 


Leu 


Ala 


Lys 


Lys 


Cys 


Met 


Ser 






275 










280 










285 








Val 


Phe 


Gin 


Thr 


Tyr 


Val 


Asn 


Met 


Met 


Asn Asp Asp 


lie 


Arg 


Lys 


Lys 




290 










295 










300 










Phe 


Arg Asp 


Ser 


Gin 


Thr 


Asn 


Pro 


Phe 


lie 


Phe 


Lys 


Asn 


lie 


Ser 


Tyr 


305 










310 










315 










320 


Leu 


Arg 


Asn 


Leu 


Glu 


Asp 


Phe 


Gin 


Asp 


Phe 


Gly 


Pro 


Ser 


Val 


Met 


Leu 










325 










330 










335 




Ala 


Ser 


Pro 


Gly 


Met 


Leu 


Gin 


Ser 


Gly Leu 


Ser Arg 


Asp 


Leu 


Leu 


Glu 








340 










345 










350 






Arg 


Trp 


Cys 


Pro 


Glu 


Asp 


Lys 


Asn 


Leu 


Val 


Leu 


lie 


Thr 


Gly Tyr 


Ser 






355 










360 










365 
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lie Glu Gly Thr Met Ala Lys Phe lie Met Leu Glu Pro Asp Thr lie 
370 375 380 



Pro Ser lie Asn Asn Pro Glu lie Thr lie Pro Arg Arg Cys Gin Val 
385 390 395 400 



Glu Glu He Ser Phe Ala Ala His Val Asp Phe Gin Glu Asn Leu Glu 
405 410 415 



Phe lie Glu Lys lie Ser Ala Pro Asn lie lie Leu Val His Gly Glu 
420 425 430 



Ala Asn Pro Met Gly Arg Leu Lys Ser Ala Leu Leu Ser Asn Phe Ala 
435 440 445 



Ser Leu Lys Gly Thr Asp Asn Glu Val His Val Phe Asn Pro Arg Asn 
450 455 460 



Cys Val Glu Val Asp Leu Glu Phe Gin Gly Val Lys Val Ala Lys Ala 
465 470 475 480 



Val Gly Asn lie Val Asn Glu lie Tyr Lys Glu Glu Asn Val Glu lie 
485 490 495 



Lys Glu Glu lie Ala Ala Lys lie Glu Pro lie Lys Glu Glu Asn Glu 
500 505 510 



Asp Asn Leu Asp Ser Gin Ala Glu Lys Gly Leu Val Asp Glu Glu Glu 
515 520 525 



His Lys Asp lie Val Val Ser Gly lie Leu Val Ser Asp Asp Lys Asn 
530 535 540 



Phe Glu Leu Asp Phe Leu Ser Leu Ser Asp Leu Arg Glu His His Pro 
545 550 555 560 



Asp Leu Ser Thr Thr lie Leu Arg Glu Arg Gin Ser Val Arg Val Asn 
565 570 575 



Cys Lys Lys Glu Leu lie Tyr Trp His lie Leu Gin Met Phe Gly Glu 
580 585 590 



Ala Glu Val Leu Gin Asp Asp Asp Arg Val Thr Asn Gin Glu Pro Lys 
595 600 605 



Asn Leu Thr Asn Thr Gly Lys Leu lie 
620 



Val Lys Glu Glu Ser Lys Asp 
610 615 
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Leu Gin lie Met Gly Asp lie Lys Leu Thr lie Val Asn Thr Leu Ala 
625 630 635 640 



Val Val Glu Trp Thr Gin Asp Leu Met Asn Asp Thr Val Ala Asp Ser 
645 650 655 



lie lie Ala He Leu Met Asn Val Asp Ser Ala Pro Ala Ser Val Lys 
660 665 670 



Leu Ser Ser His Ser Cys Asp Asp His Asp His Asn Asn Val Gin Ser 
675 680 685 



Asn Ala Gin Gly Lys lie Asp Glu Val Glu Arg Val Lys Gin lie Ser 
690 695 700 



Arg Leu Phe Lys Glu Gin Phe Gly Asp Cys Phe Thr Leu Phe Leu Asn 
705 710 715 720 



Lys Asp Glu Tyr Ala Ser Asn Lys Glu Glu Thr lie Thr Gly Val Val 
725 730 735 



Thr lie Gly Lys Ser Thr Ala Lys lie Asp Phe Asn Asn Met Lys lie 
740 745 750 



Leu Glu Cys Asn Ser Asn Pro Leu Lys Gly Arg Val Glu Ser Leu Leu 
755 760 765 



Asn lie Gly Gly Asn Leu Val Thr Pro Leu Cys 
770 775 



76 

2000 

DNA 

Saccharomyces cerevisiae 



< 210 > 

< 211 > 

< 212 > 

< 213 > 

< 400 > 76 

atggagcgaa 

ggacgatcat 

catccggcat 

gtcgatatct 

atgcaacgga 

agatggctac 

gggactaaag 

gaaacggtgg 



caaatacaac 

gtcatatatt 

atcaaggatt 

tactgatatc 

ctaactttca 

tgcgagattt 

acgaaggtct 

actatcattc 



aacattcaaa 

gcaatataaa 

agcttcatta 

acatttccat 

aggcagagtt 

tgtaagagtt 

attctcagat 

tactgttgac 



tttttttcat tgggaggaag taacgaagtt 
ggtaaaacag tgatgctcga cgcaggaatt 
cctttttacg atgaatttga tctttccaaa 
ttagaccatg ccgcttcact tccgtatgtg 
ttcatgacac atccaaccaa agccatttac 
actagtatag gttcttcatc ctcctctatg 
gaggatttag ttgattcttt cgataaaatt 
gtcaatggta tcaaatttac ggcattccat 



60 

120 

180 

240 

300 

360 

420 

480 
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gcaggccatg 


tattgggtgc 


agcgatgttt 


caaatagaga 


ttgctggcct 


tagggtgtta 


540 


ttcacaggtg 


actattcgag 


agaagtagat 


cgtcacttaa 


attctgctga 


99t9 cctc ca 


600 


ctttcatcca 


acgtattaat 


tgtggaatct 


acctttggta 


cc gctactca 


cgagccccgt 


660 


ttaaatagag 


aaaggaagct 


aacccaacta 


atccattcca 


cagtgatgag 


aggaggccgt 


720 


gttctactgc 


ctgtttttgc 


tttagggaga 


gctcaagaaa 


tcatgcttat 


actggatgag 


780 


tactggtctc 


aacatgctga 


tgaactcggt 


ggtggacaag 


tcccaatatt 


ttatgcatca 


840 


aatttggcaa 


aaaaatgtat 


gagcgttttt 


caaacctatg 


taaatatgat 


gaatgatgac 


900 


attagaaaaa 


aatttagaga 


ctctcagact 


aatcccttca 


tattcaaaaa 


tatatcttat 


960 


cttagaaact 


tggaggattt 


ccaagatttt 


ggtcccagtg 


tgatgttggc 


ctcaccaggt 


1020 


atgctgcaaa 


gtgggttatc 


aagagattta 


cttgaaaggt 


ggtgccctga 


agataaaaat 


1080 


ctagtactaa 


tcaccggtta 


ctccatcgaa 


ggaacaatgg 


cgaaatttat 


tatgcttgag 


1140 


ccagatacaa 


taccttccat 


aaataatccg 


gaaataacca 


ttccaagacg 


ttgtcaagtt 


1200 


gaagaaatct 


cctttgccgc 


acacgttgac 


ttccaggaaa 


atttagaatt 


tattgaaaag 


1260 


attagtgcgc 


caaatatcat 


ccttgttcat 


ggagaagcca 


atcccatggg 


ccgtttgaaa 


1320 


tctgcattgt 


tatccaattt 


cgcgtcttta 


aa 999 tacag 


ataatgaagt 


ccatgttttt 


1380 


aatcctagaa 


actgtgttga 


agtagatctt 


gaatttcaag 


gtgtcaaggt 


tgcaaaagct 


1440 


gtgggaaata 


ttgtgaacga 


aatatataaa 


gaagaaaacg 


tagagataaa 


ggaagaaatt 


1500 


gcggctaaaa 


ttgaacctat 


aaaggaagaa 


aacgaagaca 


acttggattc 


tcaagcagaa 


1560 


aaaggtttgg 


ttgatgaaga 


ggaacacaaa gacatagtcg 


tttctgggat 


tctagtttca 


1620 


gatgacaaaa 


atttcgaatt 


agacttcctt 


tctttgtctg 


atttaagaga 


gcaccatccc 


1680 


gatctttcta 


caacgatatt 


aagagagcgc 


cagtcagttc 


gtgtaaattg 


taaaaaggag 


1740 


ctaatttact 


ggcacatttt 


acaaatgttt 


ggagaggctg 


aagttcttca 


agatgatgat 


1800 


agagtaacaa 


atcaagaacc 


aaaggttaaa 


gaagagtcaa 


aagacaatct 


gaccaataca 


1860 


ggtaaattga 


ttctacagat 


aatgggtgat 


attaagttaa 


ctattgttaa 


tactctagcc 


1920 


gttgtggaat 


ggactcaaga 


tttaatgaat 


gacactgtag 


cagactccat 


tattgctata 


1980 


cttatgaatg 


tggattcagc 










2000 



<210> 77 

<211> 208 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 77 

Met Ser Leu lie His Pro Asp Thr Ala Lys Tyr Pro Phe Lys Phe Glu 
15 10 15 
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Pro Phe Leu Arg Gin Glu Tyr Ser Phe Ser Leu Asp Pro Asp Arg Pro 
20 25 30 



lie Cys Glu Phe Tyr Asn Ser Arg Glu Gly Pro Lys Ser Cys Pro Arg 
35 40 45 



Gly Pro Leu Cys Pro Lys Lys His Val Leu Pro lie Phe Gin Asn Lys 
50 55 60 



lie Val Cys Arg His Trp Leu Arg Gly Leu Cys Lys Lys Asn Asp Gin 
65 70 75 80 



Cys Glu Tyr Leu His Glu Tyr Asn Leu Arg Lys Met Pro Glu Cys Val 
85 90 95 



Phe Phe Ser Lys Asn Gly Tyr Cys Thr Gin Ser Pro Asp Cys Gin Tyr 
100 105 110 



Leu His lie Asp Pro Ala Ser Lys lie Pro Lys Cys Glu Asn Tyr Glu 
115 120 125 



Met Gly Phe Cys Pro Leu Gly Ser Ser Cys Pro Arg Arg His lie Lys 
130 135 140 



Lys Val Phe Cys Gin Arg Tyr Met Thr Gly Phe Cys Pro Leu Gly Lys 
145 150 155 160 



Asp Glu Cys Asp Met Glu His Pro Gin Phe lie lie .Pro Asp Glu Gly 
165 170 175 



Ser Lys Leu Arg lie Lys Arg Asp Asp Glu lie Asn Thr Arg Lys Met 
180 185 190 



Asp Glu Glu Lys Glu Arg Arg Leu Asn Ala lie lie Asn Gly Glu Val 
195 200 205 



<210> 78 

<211> 627 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 78 

atgagcctaa ttcaccccga tacagcaaaa tatcctttta aatttgaacc tttcctcagg 60 

caagagtatt cgttttcact cgatcctgac agacctattt gtgaatttta caattctaga 120 

gaaggcccta aatcatgtcc gaggggaccg ttatgtccaa aaaagcatgt gttaccaata 180 

tttcagaata aaattgtttg tagacattgg cttcgagggt tgtgcaaaaa gaatgaccaa 240 
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tgtgaatact tacatgaata 
aacgggtact gtacacaaag 
ataccaaaat gtgaaaatta 
cggcatatta agaaggtttt 
gatgaatgtg atatggaaca 
attaagagag acgatgagat 
aacgcaatta taaacggtga 



caatcttcga aaaatgcctg 
tccagattgt caatatctac 
cgaaatggga ttctgtcctc 
ctgtcaaaga tacatgaccg 
tccacagttc ataatcccag 
aaataccagg aaaatggatg 
agtttga 



aatgtgtctt cttcagcaaa 
acatagatcc cgctagcaag 
ttgtcctaga 
gattttgtcc tttagggaag 
atgaaggtag taaattaaga 
aagaaaagga aaggcgttta 



Cys Pro Glu Glu Thr 
15 

Gly Phe Asp Lys Met 
30 

Thr Phe Ser Gin Gly 
45 

Ala Ser Ser Glu Lys 
60 

Lys Met Lys Thr Tyr 
80 

Gly Glu Gly Glu Asp 
95 



aagaaacgtt aaacgaagaa 
ccctgttgcg gtctcgagaa 
gccaatcagt gataaatgca 



<210> 79 

<211> 107 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 79 

Met Val Lys Gly Lys Thr Phe Leu Lys Arg lie 
15 10 



Leu Asn Glu Glu Thr Lys Gin Glu Val Ser Val 
20 25 



Arg Thr Leu Leu Arg Ser Arg Glu Ser Gly Met 
35 40 



Pro Lys Leu Ala Ser Cys Gin Ser Val lie Asn 
50 55 



Thr Ala Trp Thr Gin Leu Val Phe Arg Lys Ser 
65 70 75 



Thr Lys Ser Val His Val He Phe lie Ala Met 
85 90 



Glu Ser Val Asp Met Asn Val Gly lie Ser Tyr 





100 


<210> 


80 


<211> 


324 


<212> 


DNA 


<213> 


Saccharomyces cerevisiae 


<400> 


80 



atggtaaagg gtaaaacgtt tctgaaaaga atctgtccgg 
actaagcagg aagtttcggt agggttcgat aagatgagaa 
tcagggatga ctttctccca aggacctaag ttagccagtt 



300 

360 

420 

480 

540 

600 

627 



60 

120 



180 
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tcatctgaaa aaacggcttg gacacaactc gtgtttagga agagtaaaat gaagacgtac 240 
accaagtctg tacacgttat cttcattgct atgggggaag gggaggatga aagtgttgat 300 
atgaatgtag gtattagtta ttaa 324 



<210> 81 
<211> 425 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 81 

Met Thr Asp Pro Arg Arg Arg Thr Gly Arg His Phe Leu Thr Pro Glu 
15 10 15 



Asn Leu Ser Ser Thr Leu Gin lie Thr Asn Leu Pro Pro Glu Trp Asn 
20 25 30 



Gin Asp He lie Thr Ser Val Val Ala Gly Ser Gly Pro Val lie Asp 
35 40 45 



lie Lys Ala Lys Asn Asp Pro Arg Thr Gly Lys Leu Thr Gly Val Leu 
50 55 60 



Phe Asp Tyr Leu Thr Ser Lys Asp Cys Lys Arg Ala Trp Glu He Leu 
65 70 75 80 



Asn Arg lie Glu Asn Phe Pro Val Lys lie Glu Gin lie lie Pro Pro 
85 90 95 



Asn Tyr Lys Asp His Leu Arg Glu Thr Ala Asn Lys Asn. Ser Gin Lys 
100 105 110 



Gin Val Leu Gin Leu Asn Arg Asp Ser Tyr Pro Phe Glu Ala Gly Leu 
115 120 125 



Glu Leu Pro Phe Glu Met Val Thr Glu Val Pro lie Pro Arg Arg Pro 
130 135 140 



Pro Pro Pro Gin Ala Ala Asn Asn Thr Asn Ser Val Ser Asn Asn Thr 
145 150 155 160 



Asn lie Gin Phe Pro Asp lie Leu Ser Lys Ala Ser Lys His Leu Pro 
165 170 175 



Ser Phe Gin Asp Gly Ser lie lie Ala Pro Asp Lys lie Ser Gin Asn 
180 185 190 



Leu Ser Lys lie Pro Pro Leu Gin Leu lie Glu lie lie Ser Asn Leu 
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195 



130/148 

200 



205 



Lys lie Leu Ser Asn Gin Glu Asn He Gin Lys Ser Gin Leu Glu Ser 
210 215 220 



Phe Leu Asp Thr Asn Ser Asp lie Thr lie Ser Val Thr Gin Ala Leu 
225 230 235 240 



Leu Glu Met Gly Phe lie Asp Tyr Ser Val Val Thr Lys Val Leu Lys 
245 250 255 



Ser Gin Val Gly Glu Ala Pro Ser Leu Leu Ser Ser Asn Asn Thr Ser 
260 265 270 



Asn Ser Asn Thr Pro Val Ser Val lie Arg Asn Asn Thr Pro Leu His 
275 280 285 



Val Pro Ser Asn Glu Val Ser Asn Asn Pro Asn Asn Met Pro Leu Asn 
290 295 300 



Val Ala Met Pro Met Pro Met Ser Thr Pro Pro Phe lie Pro Leu Pro 
305 310 315 320 



Leu Gin Gin Gin Pro Phe Gly Phe Ala Pro Pro Gly Pro Phe Met Pro 
325 330 335 



Pro Ala Gin Gly Pro Ser Met Gly Gin Pro Val Leu Ala Asn Gin Leu 
340 345 350 



Gly Gin Val Gin Gin Gin Asn lie Ser Ser Thr Glu Gly Pro Ser Asn 
355 360 365 



Ala Asn Lys Ala Asn Asp Ser Gly Thr lie Asn Met Ala Lys Leu Gin 
370 375 380 



Leu Leu Pro Glu Asn Gin Gin Asp Met lie Lys Gin Val Leu Thr Leu 
385 390 395 400 



Thr Pro Ala Gin He Gin Ser Leu Pro Ser Asp Gin Gin Leu Met Val 
405 410 415 



Glu Asn Phe Arg Lys Glu Tyr lie lie 
420 425 



<210> 82 
<211> 1278 

<212 > DNA 

<213 > Saccharomyces cerevisiae 
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<400> 82 



atgacagatc 


ccagaagaag 


aacaggccgt 


catttcttga 


caccggagaa 


tttatcctct 


60 


acattacaaa 


tcacaaactt 


acctccagaa 


tggaaccaag 


atataattac 


ttcggtcgtg 


120 


gccggttctg 


gtccagttat 


agatataaaa 


gctaagaatg 


acccgagaac 


tggtaaacta 


180 


accggtgtac 


tgttcgatta 


tttgactagt 


aaagattgta 


aacgcgcttg 


ggaaatttta 


240 


aatagaattg 


aaaactttcc 


cgtaaagata 


gagcaaataa 


tcccaccaaa 


ttataaggac 


300 


catcttagag 


aaacagcaaa 


taaaaattct 


caaaagcagg 


tattacaact 


taatagagat 


360 


tcgtacccct 


tcgaggcggg 


tttggagcta 


cctttcgaaa 


tggtgacaga 


agtccccatt 


420 


cctaggcgac 


caccgccacc 


acaggctgca 


aataacacaa 


actctgtatc 


aaataacaca 


480 


aacattcaat 


tccccgacat 


actaagtaaa 


gcatctaaac 


acttgccaag 


tttccaagat 


540 


ggctcgatta 


ttgcaccaga 


caaaatttca 


caaaatttaa 


gtaaaattcc 


gccgttgcaa 


600 


cttattgaaa 


ttatatcaaa 


tttgaaaata 


ttatcaaacc 


aagaaaacat 


ccaaaaatcg 


660 


caattagaat 


ctttcttaga 


tactaacagt 


gatatcacaa 


tatcagtgac 


ccaagcccta 


720 


ctagaaatgg 


gatttataga 


ctacagcgtg 


gtgactaaag 


tgttgaaatc 


ccaagttggc 


780 


gaggccccat 


ctttgctttc 


gagtaataac 


acaagtaatt 


cgaacacccc 


cgtaagcgta 


840 


attagaaata 


acactccgtt 


gcatgtacct 


tctaatgaag 


tcagcaacaa 


tcctaacaat 


900 


atgccactga 


acgtagctat 


gccaatgcct 


atgtcgacac 


caccatttat 


ccctttacct 


960 


ctgcaacaac 


aaccgttcgg 


ttttgcgcca 


ccgggccctt 


tcatgcctcc 


agctcaaggc 


1020 


ccctccatgg 


gacagcctgt 


gttggcaaat 


caactcggcc 


aggtccagca 


acaaaatata 


1080 


agttctacag 


aaggaccctc 


taacgcgaat 


aaagcgaatg 


acagcggcac- 


cattaatatg 


1140 


gcgaaactgc 


aattactacc 


tgaaaaccaa 


caagatatga 


tcaaacaagt 


tcttactttg 


1200 


acacctgccc 


agatccaaag 


tttaccaagt 


gaccagcaac 


ttatggtgga 


aaactttaga 


1260 


aaagaatata 


taatctaa 










1278 



<210> 83 

<211> 1592 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 83 

Met Gly Thr Asp Pro Leu lie lie Arg Asn Asn Gly Ser Phe Trp Glu 
15 10 15 

Val Asp Asp Phe Thr Arg Leu Gly Arg Thr Gin Leu Leu Ser Tyr Tyr 
20 25 30 

Leu Pro Leu Ala lie lie Ala Ser lie Gly lie Phe Ala Leu Cys Arg 
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35 










40 










45 








Ser 


Gly Leu 


Ser 


Arg 


Tyr 


Val 


Arg 


Ser 


Ala 


Glu 


Cys 


Asp 


Leu 


Val 


Asn 




50 










55 










60 










Glu 


Tyr 


Leu 


Phe 


Gly Ala 


Gin 


Glu 


Glu Arg 


Lys 


Glu 


Asp 


Asn 


Ser 


lie 


65 










70 










75 










80 


Glu 


Arg 


Leu 


Leu 


Arg 


Asn 


Ser 


Asn 


Thr 


Gin 


Ala 


Asn 


Tyr 


Val 


Asn 


Val 










85 










90 










95 




Lys 


Lys 


Gin 


Gly Arg 


He 


Leu 


Lys 


Leu 


Arg 


His 


Phe 


Asp 


lie 


Thr 


Thr 








100 










105 










110 






He 


Asp 


Val 


Lys 


Gin 


lie 


Asp 


Ala 


Lys 


Asn 


His 


Gly Gly 


Leu 


Thr 


Phe 






115 










120 










125 








Ser Arg 


Pro 


Ser 


Thr 


Ser Asp 


His 


Leu Arg 


Lys 


Ser 


Ser 


Glu 


lie 


Val 




130 










135 










140 










Leu 


Met 


Ser 


Leu 


Gin 


lie 


lie 


Gly 


Leu 


Ser 


Phe 


Leu 


Arg 


Val 


Thr 


Lys 


145 










150 










155 










160 


lie 


Asn 


lie 


Glu 


Leu 


Thr 


Asn Arg Asp 


Val 


Thr 


Thr 


Leu 


Leu 


Leu 


Phe 










165 










170 










175 




Trp 


Leu 


lie 


Leu 


Leu 


Ser 


Leu 


Ser 


lie 


Leu Arg 


Val 


Tyr 


Lys 


Arg 


Ser 








180 










185 










190 






Thr 


Asn 


Leu 


Trp 


Ala 


lie 


Cys 


Phe 


Thr 


Ala 


His 


Thr 


Thr 


lie 


Trp 


lie 






195 










200 










205 








Ser 


Thr 


Trp 


lie 


Pro 


lie 


Arg 


Ser 


Val 


Tyr 


lie 


Gly Asn 


lie 


Asp Asp 




210 










215 










220 










Val 


Pro 


Ser 


Gin 


lie 


Phe 


Tyr 


lie 


Phe 


Glu 


Phe 


Val 


lie 


Thr 


Ser 


Thr 


225 










230 










235 










240 


Leu 


Gin 


Pro 


lie 


Lys 


Leu 


Thr 


Ser 


Pro 


lie 


Lys 


Asp 


Asn 


Ser 


Ser 


lie 










245 










250 










255 




lie 


Tyr 


Val 


Arg 


Asp 


Asp 


His 


Thr 


Ser 


Pro 


Ser 


Arg 


Glu 


His 


lie 


Ser 








260 










265 










270 






Ser 


lie 


Leu 


Ser 


Cys 


lie 


Thr 


Trp 


Ser 


Trp 


lie 


Thr 


Asn 


Phe 


lie 


Trp 






275 










280 










285 
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Glu Ala Gin Lys Asn Thr lie Lys Leu Lys Asp lie Trp Gly Leu Ser 
290 295 300 



Met Glu Asp Tyr Ser lie Phe He Leu Lys Gly Phe Thr Arg Arg Asn 
305 310 315 320 



Lys His lie Asn Asn Leu Thr Leu Ala Leu Phe Glu Ser Phe Lys Thr 
325 330 335 



Tyr Leu Leu He Gly Met Leu Trp Val Leu Val Asn Ser lie Val Asn 
340 345 350 



Leu Leu Pro Thr lie Leu Met Lys Arg Phe Leu Glu lie Val Asp Asn 
355 360 365 



Pro Asn Arg Ser Ser Ser Cys Met Asn Leu Ala Trp Leu Tyr lie lie 
370 375 380 



Gly Met Phe lie Cys Arg Leu Thr Leu Ala lie Cys Asn Ser Gin Gly 
385 390 395 400 



Gin Phe Val Ser Asp Lys lie Cys Leu Arg He Arg Ala lie Leu lie 
405 410 415 



Gly Glu lie Tyr Ala Lys Gly Leu Arg Arg Arg Leu Phe Thr Ser Pro 
420 425 430 



Lys Thr Ser Ser Asp Ser Asp Ser lie Ser Ala Asn Leu Gly Thr lie 
435 440 445 



lie Asn Leu lie Ser lie Asp Ser Phe Lys Val Ser Glu Leu Ala Asn 
450 455 460 



Tyr Leu Tyr Val Thr Val Gin Ala Val lie Met lie lie Val Val Val 
465 470 475 480 



Gly Leu Leu Phe Asn Phe Leu Gly Val Ser Ala Phe Ala Gly lie Ser 
485 490 495 



lie lie Leu Val Met Phe Pro Leu Asn Phe Leu Leu Ala Asn Leu Leu 
500 505 510 



Gly Lys Phe Gin Lys Gin Thr Leu Lys Cys Thr Asp Gin Arg lie Ser 
515 520 525 



Lys Leu Asn Glu Cys Leu Gin Asn lie Arg lie Val Lys Tyr Phe Ala 
530 . 535 540 
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Trp Glu Arg Asn lie lie Asn Glu lie Lys Ser lie Arg Gin Lys Glu 
545 550 555 560 



Leu Arg Ser Leu Leu Lys Lys Ser Leu Val Trp Ser Val Thr Ser Phe 
565 570 575 



Leu Trp Phe Val Thr Pro Thr Leu Val Thr Gly Val Thr Phe Ala lie 
580 585 590 



Cys Thr Phe Val Gin His Glu Asp Leu Asn Ala Pro Leu Ala Phe Thr 
595 600 605 



Thr Leu Ser Leu Phe Thr Leu Leu Lys Thr Pro Leu Asp Gin Leu Ser 
610 615 620 



Asn Met Leu Ser Phe He Asn Gin Ser Lys Val Ser Leu Lys Arg lie 
625 630 635 640 



Ser Asp Phe Leu Arg Met Asp Asp Thr Glu Lys Tyr Asn Gin Leu Thr 
645 650 655 



lie Ser Pro Asp Lys Asn Lys lie Glu Phe Lys Asn Ala Thr Leu Thr 
660 665 670 



Trp Asn Glu Asn Asp Ser Asp Met Asn Ala Phe Lys Leu Cys Gly Leu 
675 680 685 



Asn lie Lys Phe Gin He Gly Lys Leu Asn Leu lie Leu Gly Ser Thr 
690 695 700 



Gly Ser Gly Lys Ser Ala Leu Leu Leu Gly Leu Leu Gly Glu Leu Asn 
705 710 715 720 



Leu lie Ser Gly Ser lie lie Val Pro Ser Leu Glu Pro Lys His Asp 
725 730 735 



Leu lie Pro Asp Cys Glu Gly Leu Thr Asn Ser Phe Ala Tyr Cys Ser 
740 745 750 



Gin Ser Ala Trp Leu Leu Asn Asp Thr Val Lys Asn Asn lie lie Phe 
755 760 765 



Asp Asn Phe Tyr Asn Glu Asp Arg Tyr Asn Lys Val lie Asp Ala Cys 
770 775 780 



Gly Leu Lys Arg Asp Leu Glu lie Leu Pro Ala Gly Asp Leu Thr Glu 
785 790 795 800 
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lie Gly Glu Lys Gly lie Thr Leu Ser Gly Gly Gin Lys Gin Arg lie 
805 810 815 



Ser Leu Ala Arg Ala Val Tyr Ser Ser Ala Lys His Val Leu Leu Asp 
820 825 830 



Asp Cys Leu Ser Ala Val Asp Ser His Thr Ala Val Trp lie Tyr Glu 
835 840 845 



Asn Cys lie Thr Gly Pro Leu Met Lys Asn Arg Thr Cys lie Leu Val 
850 855 860 



Thr His Asn Val Ser Leu Thr Leu Arg Asn Ala His Phe Ala lie Val 
865 870 875 880 



Leu Glu Asn Gly Lys Val Lys Asn Gin Gly Thr lie Thr Glu Leu Gin 
885 890 895 



Ser Lys Gly Leu Phe Lys Glu Lys Tyr Val Gin Leu Ser Ser Arg Asp 
900 905 910 



Ser lie Asn Glu Lys Asn Ala Asn Arg Leu Lys Ala Pro Arg Lys Asn 
915 920 925 



Asp Ser Gin Lys lie Glu Pro Val Thr Glu Asn lie Asn Phe Asp Ala 
930 935 940 



Asn Phe Val Asn Asp Gly Gin Leu lie Glu Glu Glu Glu Lys Ser Asn 
945 950 955 960 



Gly Ala lie Ser Pro Asp Val Tyr Lys Trp Tyr Leu Lys Phe Phe Gly 
965 970 975 



Gly Phe Lys Ala Leu Thr Ala Leu Phe Ala Leu Tyr lie Thr Ala Gin 
980 985 990 



He Leu Phe lie Ser Gin Ser Trp Trp lie Arg His Trp Val Asn Asp 
995 1000 1005 



Thr Asn Val Arg lie Asn Ala Pro Gly Phe Ala Met Asp Thr Leu 
1010 1015 1020 



Pro Leu Lys Gly Met Thr Asp Ser Ser Lys Asn Lys His Asn Ala 
1025 1030 1035 



Phe Tyr Tyr Leu Thr Val Tyr Phe Leu lie Gly lie lie Gin Ala 
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1040 1045 1050 

Met Leu Gly Gly Phe Lys Thr Met Met Thr Phe Leu Ser Gly Met 

1055 1060 1065 

Arg Ala Ser Arg Lys lie Phe Asn Asn Leu Leu Asp Leu Val Leu 

1070 1075 1080 

His Ala Gin lie Arg Phe Phe Asp Val Thr Pro Val Gly Arg lie 

1085 1090 1095 

Met Asn Arg Phe Ser Lys Asp lie Glu Gly Val Asp Gin Glu Leu 

1100 1105 1110 

lie Pro Tyr Leu Glu Val Thr. lie Phe Cys Leu He Gin Cys Ala 

1115 1120 1125 

Ser He lie Phe Leu lie Thr Val lie Thr Pro Arg Phe Leu Thr 

1130 1135 1140 

Val Ala Val lie Val Phe Val Leu Tyr Phe Phe Val Gly Lys Trp 

1145 1150 1155 

Tyr Leu Thr Ala Ser Arg Glu Leu Lys Arg Leu Asp Ser lie Thr 

1160 1165 1170 

Lys Ser Pro lie Phe Gin His Phe Ser Glu Thr Leu Val Gly Val 

1175 1180 1185 

Cys Thr lie Arg Ala Phe Gly Asp Glu Arg Arg Phe lie Leu Glu 

1190 1195 1200 

Asn Met Asn Lys lie Asp Gin Asn Asn Arg Ala Phe Phe Tyr Leu 

1205 1210 1215 

Ser Val Thr Val Lys Trp Phe Ser Phe Arg Val Asp Met lie Gly 

1220 1225 1230 

Ala Phe lie Val Leu Ala Ser Gly Ser Phe lie Leu Leu Asn lie 

1235 1240 1245 

Ala Asn lie Asp Ser Gly Leu Ala Gly lie Ser Leu Thr Tyr Ala 

1250 1255 1260 

lie Leu Phe Thr Asp Gly Ala Leu Trp Leu Val Arg Leu Tyr Ser 

1265 1270 1275 
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Thr 


Phe 

1280 


Glu 


Met 


Asn 


Met 


Asn 

1285 


Ser 


Val 


Glu Arg 


Leu 

1290 


Lys 


Glu 


Tyr 


Ser 


Ser 

1295 


lie 


Glu 


Gin 


Glu 


Asn 

1300 


Tyr 


Leu 


Gly His 


Asp 

1305 


Glu Gly Arg 


lie 


Leu 

1310 


Leu 


Leu 


Asn 


Glu 


Pro 

1315 


Ser 


Trp 


Pro 


Lys 


Asp 

1320 


Gly Glu 


lie 


Glu 


lie 

1325 


Glu 


Asn 


Leu 


Ser 


Leu 

1330 


Arg 


Tyr Ala 


Pro 


Asn 

1335 


Leu 


Pro 


Pro 


Val 


lie 

1340 


Arg Asn 


Val 


Ser 


Phe 

1345 


Lys 


Val 


Asp 


Pro 


Gin 

1350 


Ser 


Lys 


lie 


Gly 


lie 

1355 


Val 


Gly Arg 


Thr 


Gly 

1360 


Ala 


Gly Lys 


Ser 


Thr 

1365 


lie 


lie 


Thr 


Ala 


Leu 

1370 


Phe 


Arg 


Leu 


Leu 


Glu 

1375 


Pro 


He 


Thr 


Gly 


Cys 

1380 


lie 


Lys 


lie 


Asp 


Gly 

1385 


Gin Asp 


lie 


Ser 


Lys 

1390 


lie 


Asp 


Leu 


Val 


Thr 

1395 


Leu 


Arg Arg 


Ser 


lie 

1400 


Thr 


lie 


lie 


Pro 


Gin 

1405 


Asp 


Pro 


lie 


Leu 


Phe 

1410 


Ala 


Gly 


Thr 


He 


Lys 

1415 


Ser 


Asn 


Val 


Asp 


Pro 

1420 


Tyr Asp 


Glu 


Tyr Asp 
1425 


Glu 


Lys 


Lys 


lie 


Phe 

1430 


Lys 


Ala 


Leu 


Ser 


Gin 

1435 


Val 


Asn 


Leu 


lie 


Ser 

1440 


Ser 


His 


Glu 


Phe 


Glu 

1445 


Glu 


Val 


Leu 


Asn 


Ser 

1450 


Glu 


Glu Arg 


Phe 


Asn 

1455 


Ser 


Thr 


His 


Asn 


Lys 

1460 


Phe 


Leu 


Asn 


Leu 


His 

1465 


Thr 


Glu 


lie 


Ala 


Glu 

1470 


Gly Gly Leu 


Asn 


Leu 

1475 


Ser 


Gin 


Gly Glu Arg 
1480 


Gin 


Leu 


Leu 


Phe 


lie 

1485 


Ala Arg 


Ser 


Leu 


Leu 

1490 


Arg 


Glu 


Pro 


Lys 


lie 

1495 


lie 


Leu 


Leu 


Asp 


Glu 

1500 


Ala 


Thr 


Ser 


Ser 


He 

1505 


Asp 


Tyr Asp 


Ser Asp 
1510 


His 


Leu 


lie 


Gin Gly 
1515 


lie 


He 


Arg 
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Ser Glu Phe Asn Lys Ser Thr lie Leu Thr lie Ala His Arg Leu 

1520 1525 1530 

Arg Ser Val lie Asp Tyr Asp Arg lie lie Val Met Asp Ala Gly 
1535 . 1540 1545 

Glu Val Lys Glu Tyr Asp Arg Pro Ser Glu Leu Leu Lys Asp Glu 
1550 1555 1560 

Arg Gly lie Phe Tyr Ser Met Cys Arg Asp Ser Gly Gly Leu Glu 
1565 1570 1575 

Leu Leu Lys Gin He Ala Lys Gin Ser Ser Lys Met Met Lys 

1580 1585 1590 

<210> 84 

< 211 > 2000 
< 212 > DNA 

<213 > Saccharomyces cerevisiae 
<400> 84 



atgggaacgg 


atccccttat 


tatccgaaat 


aatggttcat 


tttgggaagt 


tgatgatttt 


60 


actcgtttag 


gaagaactca 


gctattgagc 


tactatttac 


cattggctat 


catagcctca 


120 


attggcattt 


tcgcactttg 


tcgcagtgga 


ttatctcgtt 


atgtaagatc 


tgccgagtgc 


180 


gatttagtga 


acgaatatct 


atttggcgca 


caagaagaga 


gaaaagaaga 


taatagtata 


240 


gaaagacttc 


tacggaactc 


aaatacccaa 


gccaattacg 


tcaacgtcaa 


aaagcaagga 


300 


aggattttga 


aacttagaca 


ttttgatata 


acaactatag 


atgtcaagca 


aatcgatgct 


360 


aaaaatcatg 


gtggactaac 


gtttagtaga 


ccgtctacta 


gtgaccactt 


aagaaaatca 


420 


tctgaaattg 


tattaatgtc 


tttacaaata 


attggccttt 


cctttttaag 


agtaacaaaa 


480 


atcaatattg 


aattaacgaa 


cagagatgtt 


acaactttac 


tattattttg 


gttaatacta 


540 


ctttccctaa 


gtatcttaag 


agtttacaag 


cgttcaacga 


atctttgggc 


catctgtttt 


600 


actgcccata 


caactatttg 


gatttcaacc 


tggattccaa 


ttcgttcggt 


ctatattggt 


660 


aatatcgatg 


atgtaccctc 


acagatattt 


tacatctttg 


aattcgtaat 


tacttcaacc 


720 


ttacagccaa 


taaagctcac 


ttcaccgatt 


aaagacaact 


catccatcat 


ctacgttaga 


780 


gacgaccata 


cgtctccttc 


gagggaacac 


atatcctcaa 


ttttaagttg 


cattacttgg 


840 


agctggatta 


ccaattttat 


atgggaggcc 


caaaagaaca 


ctattaagtt 


aaaggatatt 


900 


tggggcttat 


caatggaaga 


ctatagcatt 


ttcattctaa 


aagggtttac 


caggagaaac 


960 


aagcacatta 


ataatttgac 


gctagcactg 


tttgaatctt 


tcaaaacata 


tttactcata 


1020 


ggaatgttat 


gggttctggt 


gaacagtatt 


gtaaaccttc 


ttcccacaat 


tttaatgaaa 


1080 
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agatttttag 


aaattgtgga 


taacccaaac 


cgttcctcat 


catgcatgaa 


tttggcgtgg 


1140 


ctttatatta 


ttggtatgtt 


catttgtaga 


ttgacattag 


caatttgtaa 


ttcccaaggt 


1200 


caatttgttt 


ctgataagat 


ttgtttaaga 


ataagagcca 


tactcatagg 


agaaatttat 


1260 


gcaaaaggct 


tacgtaggag 


gctgtttaca 


tctccaaaaa 


ccagctctga 


ttcagatagt 


1320 


atctccgcaa 


accttggtac 


cataattaat 


ctcatttcta 


ttgactcatt 


taaggtatcg 


1380 


gaactagcaa 


actaccttta 


tgtgacagtt 


caggcagtaa 


ttatgataat 


agttgttgta 


1440 


ggactacttt 


tcaacttttt 


aggtgtttca 


gcttttgcag 


gaatttcaat 


tatcttagtg 


1500 


atgttcccat 


tgaatttctt 


gttagcgaat 


ttgttaggta 


agtttcaaaa 


gcaaacactg 


1560 


aaatgtactg 


accaaagaat 


ctcaaaattg 


aacgagtgct 


tacagaacat 


aagaattgtc 


1620 


aaatattttg 


cttgggaaag 


gaatattata 


aatgaaatca 


aatcaataag 


gcaaaaggaa 


1680 


ttaagatcct 


tattaaaaaa 


atctttggtg 


tggtccgtaa 


cttcttttct 


ttggttcgtg 


1740 


acaccgacct 


tggtgacagg 


tgtcactttc 


gccatctgta 


catttgttca 


acatgaagat 


1800 


ttgaatgccc 


cgcttgcttt 


cactactttg 


tcactcttca 


ctttgttaaa 


gacacccctg 


1860 


gatcaattat 


caaatatgct 


aagtttcata 


aatcaatcaa 


aagtctctct 


aaaaagaata 


1920 


agcgattttt 


taaggatgga 


cgatacagaa 


aaatataatc 


aactaaccat 


atctccagac 


1980 


aaaaataaaa 


ttgaatttaa 










2000 



<210> 85 

<211> 329 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 85 

( , 

Met Thr Thr Val Ser lie As n Lys Pro Asn Leu Leu Lys Phe Lys His 
15 10 15 

Val Lys Ser Phe Gin Pro Gin Glu Lys Asp Cys Gly Pro Val Thr Ser 
20 25 30 

Leu Asn Phe Asp Asp Asn Gly Gin Phe Leu Leu Thr Ser Ser Ser Asn 
35 40 45 

Asp Thr Met Gin Leu Tyr Ser Ala Thr Asn Cys Lys Phe Leu Asp Thr 
50 55 60 

lie Ala Ser Lys Lys Tyr Gly Cys His Ser Ala lie Phe Thr His Ala 

65 70 75 80 

Gin Asn Glu Cys lie Tyr Ser Ser Thr Met Lys Asn Phe Asp lie Lys 

85 90 95 
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Tyr Leu Asn Leu Glu Thr Asn Gin Tyr Leu Arg Tyr Phe Ser Gly His 
100 105 110 



Gly Ala Leu Val Asn Asp Leu Lys Met Asn Pro Val Asn Asp Thr Phe 
115 120 125 



Leu Ser Ser Ser Tyr Asp Glu Ser Val Arg Leu Trp Asp Leu Lys He 
130 135 140 



Ser Lys Pro Gin Val lie lie Pro Ser Leu Val Pro Asn Cys lie Ala 
145 150 155 160 



Tyr Asp Pro Ser Gly Leu Val Phe Ala Leu Gly Asn Pro Glu Asn Phe 
165 170 175 



Glu lie Gly Leu Tyr Asn Leu Lys Lys lie Gin Glu Gly Pro Phe Leu 
180 185 190 



lie lie Lys lie Asn Asp Ala Thr Phe Ser Gin Trp Asn Lys Leu Glu 
195 200 205 



Phe Ser Asn Asn Gly Lys Tyr Leu Leu Val Gly Ser Ser lie Gly Lys 
210 215 220 



His Leu He Phe Asp Ala Phe Thr Gly Gin Gin Leu Phe Glu Leu lie 
225 230 235 240 



Gly Thr Arg Ala Phe Pro Met Arg Glu Phe Leu Asp Ser Gly Ser Ala 
245 250 255 



Cys Phe Thr Pro Asp Gly Glu Phe Val Leu Gly Thr Asp Tyr Asp Gly 
260 265 270 



Arg lie Ala lie Trp Asn His Ser Asp Ser lie Ser Asn Lys Val Leu 
275 280 285 



Arg Pro Gin Gly Phe lie Pro Cys Val Ser His Glu Thr Cys Pro Arg 
290 295 300 



Ser lie Ala Phe Asn Pro Lys Tyr Ser Met Phe Val Thr Ala Asp Glu 
305 310 315 320 



Thr Val Asp Phe Tyr Val Tyr Asp Glu 
325 



< 210 > 86 
< 211 > 990 
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<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 86 



atgaccaccg 


tgtccatcaa 


taagcccaac 


ctgctgaaat 


tcaagcatgt 


taaaagcttt 


60 


caacctcaag 


aaaaagactg 


cggacccgta 


acctcattga 


atttcgacga 


taatggccag 


120 


tttctactga 


cctcttcttc 


caacgataca 


atgcaattgt 


acagtgccac 


gaactgcaaa 


180 


ttcttggaca 


ctatagcctc 


taagaaatat 


ggctgtcact 


ccgctatctt 


tacgcacgca 


240 


caaaacgaat 


gtatctattc 


ctctacaatg 


aaaaattttg 


acattaaata 


ccttaatctg 


300 


gaaacaaacc 


aatatctaag 


atatttttcc 


ggtcatggcg 


ccctagtgaa 


tgatttgaag 


360 


atgaaccccg 


tgaacgatac 


gtttctatcg 


tcgtcatacg 


atgaatccgt 


taggctttgg 


420 


gatttgaaga 


tctctaaacc 


gcaagttatt 


ataccaagtc 


tcgtaccaaa 


ttgtatcgca 


480 


tatgatccaa 


gtggccttgt 


attcgcattg 


gggaacccag 


agaatttcga 


aatagggcta 


540 


tataatctga 


aaaaaattca 


99&999tcct 


ttcttgataa 


ttaaaattaa 


tgatgcgact 


600 


ttcagtcaat 


ggaataaatt 


agaattttct 


aacaatggaa 


agtatttatt 


agttggctcc 


660 


tcgataggaa 


agcatttaat 


ttttgacgca 


ttcacaggtc 


aacaattatt 


cgaactaata 


720 


ggaacaaggg 


ccttcccgat 


gagagaattt 


ctagattctg 


gatctgcttg 


tttcacacca 


780 


gatggtgaat 


tcgtccttgg 


aaccgattat 


gacggtagga 


ttgccatttg 


gaatcattct 


840 


gattcaataa 


gtaacaaagt 


attaaggccg 


caagggttca 


ttccctgtgt 


ttctcatgag 


900 


acctgcccca 


ggtcaattgc 


attcaaccct 


aaatattcga 


tgtttgttac 


cgcagacgaa 


960 


acagtagatt 


tttacgtgta 


cgatgaatga 








990 



<210> 87 

< 211 > 220 
<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 87 

Met Ser Ala Gly Asp lie Ser Ala lie Asn lie Lys Ser Val Lys Lys 

1 5 10 15 

Asn Arg Arg Arg Lys Lys Arg Arg Thr Ala Asp Val Ser Ser Ser Asp 

20 25 30 

Ser Ser Ser Ser Asp Pro Ser Ser Glu Ser Glu Lys Glu Glu He Gin 
35 40 45 

Asn Gly Ala lie Glu Glu His Val Gly Glu Asn Gly Lys Ser Asp His 
50 55 60 

Val Phe Ser Lys Gly Asn Asp Glu Asp Lys Gin Glu Asp He Ala lie 
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65 70 75 80 



Glu Val Ser Asp Val Glu Leu Thr Asp Glu Glu Ser Lys Asp Leu Lys 
85 90 95 



Leu Asn Ser Lys Glu Val lie Asp Asp Leu Thr Lys lie Ser Leu Ser 
100 105 110 



Lys He Pro Glu Pro Thr Lys Ser Gin Asn Lys Glu Gly Phe Met Asn 
115 120 125 



Ala Ser Lys lie Ala Glu Asn lie Lys Leu Ala Arg Glu Glu Tyr Asn 
130 135 140 



Glu Leu Ala Glu Asn Phe Val Pro Lys Gly Lys Asp Lys Thr Lys Leu 
145 150 155 160 



Arg Glu Glu Tyr Leu Asn Leu Leu Phe Glu Asn Tyr Gly Asp Asp lie 
165 170 175 



Asn Arg Leu Arg Ala Ala Pro Asp Phe Thr Asn Lys Ser Leu Ser 'lie 
180 185 190 



Leu Ala Asp Ala Leu Gin Glu Gly lie Gly Met Phe Asp lie Gly Glu 
195 200 205 



Leu Glu Leu Val Leu Lys Asn Lys Glu Met Glu Asn 
210 215 220 



< 210 > 88 
<211> 663 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 88 

atgtcggcag gtgatatatc agccataaat atcaagtctg tcaaaaaaaa cagaaggagg 60 

aagaagagaa gaacagctga tgtttcatca tcagattctt catcatcgga tccatcatca 120 

gaaagtgaaa aggaggaaat ccaaaatggg gccatcgaag aacacgttgg agaaaatggt 180 

aaaagtgatc atgttttctc aaaaggtaat gacgaagaca aacaagagga cattgcaata 240 

gaagtttcgg atgtcgagct tacagacgaa gaaagcaagg atttgaagtt aaattcaaaa 300 

gaagtgatag acgatttaac caaaatttct ttgagcaaga tcccagagcc tacgaaatct 360 

caaaacaagg agggttttat gaatgcatcg aaaattgccg aaaatatcaa gcttgcgaga 420 

gaagaataca atgaattggc agaaaacttt gtgcccaaag ggaaagacaa gacaaagtta 480 



agggaagaat acttaaattt actttttgaa aactacggtg atgatatcaa tcgtcttaga 540 
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gctgccccgg atttcacgaa taaatcacta tccattttgg cagatgctct gcaggaaggc 
ataggaatgt ttgatattgg tgaactagaa ttggtcttga aaaataaaga aatggagaac 
tga 



<210> 89 

<211> 441 

<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 89 

Met Ser Ser Thr lie Phe Tyr Arg Phe Lys Ser Gin Arg Asn Thr Ser 
15 10 15 



Arg He Leu Phe Asp Gly Thr Gly Leu Thr Val Phe Asp Leu Lys Arg 
20 25 30 

Glu lie lie Gin Glu Asn Lys Leu Gly Asp Gly Thr Asp Phe Gin Leu 
35 40 45 

Lys lie Tyr Asn Pro Asp Thr Glu Glu Glu Tyr Asp Asp Asp Ala Phe 
50 55 60 



Val He Pro Arg Ser Thr Ser Val lie Val Lys Arg Ser Pro Ala lie 
65 70 75 80 



Lys Ser Phe Ser Val His Ser Arg Leu Lys Gly Asn Val Gly Ala Ala 
85 90 95 



Ala Leu Gly Asn Ala Thr Arg Tyr Val Thr Gly Arg Pro Arg Val Leu. 
100 105 110 



Gin Lys Arg Gin His Thr Ala Thr Thr Thr Ala Asn Val Ser Gly Thr 
115 120 125 



Thr Glu Glu Glu Arg lie Ala Ser Met Phe Ala Thr Gin Glu Asn Gin 
130 135 140 



Trp Glu Gin Thr Gin Glu Glu Met Ser Ala Ala Thr Pro Val Phe Phe 
145 150 155 160 



Lys Ser Gin Thr Asn Lys Asn Ser Ala Gin Glu Asn Glu Gly Pro Pro 
165 170 175 



Pro Pro Gly Tyr Met Cys Tyr Arg Cys Gly Gly Arg Asp His Trp lie 
180 185 190 



Lys Asn Cys Pro Thr Asn Ser Asp Pro Asn Phe Glu Gly Lys Arg lie 



600 

660 

663 
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195 










200 










205 








Arg 


Arg 


Thr 


Thr Gly 


lie 


Pro 


Lys 


Lys 


Phe 


Leu 


Lys 


Ser 


lie 


Glu 


lie 




210 










215 










220 










Asp 


Pro 


Glu 


Thr 


Met 


Thr 


Pro 


Glu 


Glu 


Met 


Ala 


Gin Arg 


Lys 


lie 


Met 


225 










230 










235 










240 


He 


Thr 


Asp 


Glu 


Gly Lys 


Phe 


Val 


Val 


Gin 


Val 


Glu Asp Lys 


Gin 


Ser 










245 










250 










255 




Trp 


Glu Asp 


Tyr 


Gin Arg 


Lys 


Arg 


Glu 


Asn Arg 


Gin 


He 


Asp Gly Asp 








260 










265 










270 






Glu 


Thr 


lie 


Trp Arg 


Lys 


Gly His 


Phe 


Lys 


Asp 


Leu 


Pro Asp Asp 


Leu 






275 










280 










285 








Lys 


Cys 


Pro 


Leu 


Thr 


Gly Gly 


Leu 


Leu- 


Arg 


Gin 


Pro 


Val 


Lys 


Thr 


Ser 




290 


' 








295 










300 










Lys 


Cys 


Cys 


Asn 


lie 


Asp 


Phe 


Ser 


Lys 


Glu 


Ala 


Leu 


Glu 


Asn 


Ala 


Leu 


305 










310 










315 










320 


Val 


Glu 


Ser 


Asp 


Phe 


Val 


Cys 


Pro 


Asn 


Cys 


Glu 


Thr 


Arg 


Asp 


lie 


Leu 










325 










330 










335 




Leu 


Asp 


Ser 


Leu 


Val 


Pro 


Asp 


Gin Asp 


Lys 


Glu 


Lys 


Glu 


Val 


Glu 


Thr 








340 










345 










350 






Phe 


Leu 


Lys 


Lys 


Gin 


Glu 


Glu 


Leu 


His 


Gly 


Ser 


Ser 


Lys 


Asp 


Gly Asn 






355 










360 










365 








Gin 


Pro 


Glu 


Thr 


Lys 


Lys 


Met 


Lys 


Leu 


Met 


Asp 


Pro 


Thr Gly 


Thr 


Ala 




370 










375 










380 










Gly 


Leu 


Asn 


Asn 


Asn 


Thr 


Ser 


Leu 


Pro 


Thr 


Ser 


Val 


Asn 


Asn Gly Gly 


385 










390 










395 










400 


Thr 


Pro 


Val 


Pro 


Pro 


Val 


Pro 


Leu 


Pro 


Phe 


Gly 


lie 


Pro 


Pro 


Phe 


Pro 










405 










410 










415 




Met 


Phe 


Pro 


Met 


Pro 


Phe 


Met 


Pro 


Pro 


Thr 


Ala 


Thr 


lie 


Thr 


Asn 


Pro 








420 










425 










430 






His 


Gin 


Ala Asp 


Ala 


Ser 


Pro 


Lys 


Lys 




















435 










440 
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<210> 90 

<211> 1326 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 90 

atgagtagca cgatatttta ccgctttaag tctcaacgaa acacatcaag aattttattt 60 

gatggtaccg gcctgacagt atttgatttg aaaagggaaa ttattcaaga gaacaaacta 120 

ggtgacggca cagatttcca attaaaaatt tacaacccag atacagaaga ggaatacgac 180 

gatgatgcct ttgttatacc tagatctact agtgtcatag taaaaagatc tccagcaatt 240 

aaatcattct ccgtacacag tcgacttaaa gggaatgtgg gagcagcagc tcttgggaac 300 

gcaacaaggt atgttactgg taggccaaga gtgttgcaaa agagacaaca cactgctaca 360 

accactgcta atgttagtgg tacaacggaa gaagaaagaa ttgctagtat gtttgccaca 420 

caagaaaatc aatgggaaca aacgcaagaa gaaatgtctg cagccacacc tgtttttttc 480 

aagtcacaga cgaataagaa ttctgcacaa gaaaacgaag gcccaccgcc accaggttat 540 

atgtgctatc gttgtggggg tagagaccac tggattaaaa attgtccaac taacagcgat 600 

ccaaattttg aaggaaaaag aatcagaaga accacaggta ttccaaagaa gtttttaaaa 660 

tccattgaaa tagatcccga gacaatgaca ccggaagaga tggctcagcg aaagattatg 720 

attacggacg aaggcaagtt cgtggtacaa gttgaagaca aacaatcatg ggaagactac 780 

caaaggaaaa gagagaaccg tcaaattgat ggtgatgaaa ccatttggag aaaaggccat 840 

ttcaaagatc ttcctgacga tttaaaatgt cccttgacag gtggtctttt gaggcagccg 900 

gtaaagacaa gcaagtgctg taacatagat ttctcaaaag aggcgctgga aaatgcactg 960 

gtagagagcg actttgtatg ccccaattgc gaaacccgcg atatccttct cgattcttta 1020 

gtacccgacc aggacaagga aaaggaggtc gaaacgtttt tgaagaaaca agaggaacta 1080 

cacggaagct ctaaagatgg caaccagcca gaaactaaga aaatgaagtt gatggatcca 1140 

actggcaccg ctggcttgaa caacaatacc agccttccaa cttctgtaaa taacggcggt 1200 

acgccagtgc caccagtacc gttacctttc ggtatacctc ctttccccat gtttccaatg 1260 

cccttcatgc ctccaacggc tactatcaca aatcctcatc aagctgacgc aagccctaag 1320 

aaatga 1326 

<210> 91 

<211> 159 

<212> PRT 

<213 > Saccharomyces cerevisiae 
<400> 91 

Met Ser Arg Met Pro Ser Ser Phe Asp Val Thr Glu Arg Asp Leu Asp 
15 10 15 
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Asp Met Thr Phe Gly Glu Arg lie lie Tyr His Cys Lys Lys Gin Pro 
20 25 30 



Leu Val Pro lie Gly Cys Leu Leu Thr Thr Gly Ala Val lie Leu Ala 
35 40 45 



Ala Gin Asn Val Arg Leu Gly Asn Lys Trp Lys Ala Gin Tyr Tyr Phe 
50 55 60 



Arg Trp Arg Val Gly Leu Gin Ala Ala Thr Leu Val Ala Leu Val Ala 
65 70 75 80 



Gly Ser Phe lie Tyr Gly Thr Ser Gly Lys Glu Leu Lys Ala Lys Glu 
85 90 95 



Glu Gin Leu Lys Glu Lys Ala Lys Met Arg Glu Lys Leu Trp lie Gin 
100 105 110 



Glu Leu Glu Arg Arg Glu Glu Glu Thr Glu Ala Arg Arg Lys Arg Ala 
115 120 125 



Glu Leu Ala Arg Met Lys Thr Leu Glu Asn Glu Glu Glu lie Lys Asn 
130 135 140 



Leu Glu Lys Glu Leu Ser Asp Leu Glu Asn Lys Leu Gly Lys Lys 
145 150 155 



<210> 92 

<211> 480 

<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 92 

atgtcacgca tgccatctag tttcgatgtt acggagaggg atttggatga tatgaccttt 60 

ggcgaaagga ttatatacca ttgtaagaaa cagccattgg tacccattgg gtgcttgctg 120 

actacaggag ctgtcattct ggctgctcaa aatgttcgtc ttggtaataa atggaaagct 180 

cagtactact tccgttggcg tgtgggtcta caagcggcca cactagtcgc actagtcgca 240 

ggttcattta tctatgggac ttctggtaag gaactgaagg cgaaggagga acaattgaag 300 

gagaaagcca agatgagaga aaagttatgg atccaagagc tggagagaag ggaggaagaa 360 

acggaggcaa ggagaaaaag agccgaattg gcaagaatga agacccttga gaacgaagag 420 

gaaatcaaga acttagaaaa ggaactaagc gacctggaaa ataagcttgg aaagaagtaa 480 



<210> 93 

<211> 188 
<212> PRT 
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<213> Sac char omyces cerevisiae 
<400>. 93 

Met Asp Leu Pro Lys Asp Lys Ser Asp Arg Thr His Gin Arg lie Asn 
15 10 15 



Leu Asn Asn Ser Gly Thr Asp Arg Thr Asn Asp Leu Tyr Leu His lie 
20 25 30 



Val Gin Thr Phe Gly Cys lie Glu Thr Thr Ala Thr Glu Asn Ala Thr 
35 40 45 



Lys Leu Leu Met Leu Gly Asp Val Glu Val Glu He Ser Ala Ser Ser 
50 55 60 



Val Ser lie Glu Trp Thr Gin Lys Ser Met lie Ser Gin Thr lie Ala 
65 70 75 80 



Asp Ser lie Val lie Met lie lie Gly Leu Cys Ala Ser Asp Lys Asn 
85 90 95 



Val Leu Ser Glu Ser Glu Leu Lys Glu Arg Asn His Asn Val Trp Lys 
100 105 110 



lie Gin Glu Leu Gin Asn Leu Phe Arg Glu Gin Phe Gly Asp Ser Phe 
115 120 125 



Ser lie Asp Glu Gly lie Gly Lys Lys Glu Asn Val Lys Asn Gly Ser 
130 135 i 140 



Val Thr lie Gly Lys Ser Lys Ala Thr lie Asp Phe Ser Thr Met Lys 
145 150 155 160 



Leu lie Asp Cys Asn Ser Asn Pro Leu Lys Gly Arg Val Glu Ser lie 
165 170 175 



Leu Ser lie Gly Gin Lys Leu Thr Thr Pro Leu Cys 
180 185 



<210> 94 

<211> 567 

<212> DNA 

<213 > Saccharomyces cerevisiae 
<400> 94 

atggatttac ccaaggacaa aagtgacaga actcatcaaa gaattaatct aaataacagt 60 
gggacagatc gaactaatga tttgtacctt catattgtcc aaacgttcgg ttgcatagaa 120 



acaactgcaa cggaaaatgc cacgaaactg ttaatgctgg gtgacgtcga agtagaaata 180 
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tctgcgagca gcgtttcaat tgagtggaca cagaagtcaa tgataagcca aacaattgcc 240 
gatagtatag taataatgat catcggtttg tgtgcaagcg acaagaacgt gctatctgaa 300 
tcagaattga aagagagaaa ccataacgtt tggaagatcc aagaattgca aaatctgttt 360 
cgagaacaat ttggagacag ttttagcatc gatgaaggaa taggaaaaaa agaaaatgta 420 
aagaatggta gcgtcaccat aggcaagagt aaagccacga tcgatttctc caccatgaag 480 
ctgattgatt gtaattcgaa cccactaaag ggaagagtgg agagcatact aagcattggc 540 
cagaaattaa caactccatt gtgctga 567 
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This International Search Report has not been established in respect of certain claims under Article I7(2)(a) for the following reasons: 

1. [7] Claims Nos.: 20-28 

because they relate to subject matter not required to be searched by this Authority, namely: 

Rule 39.1(1v) PCT - Diagnostic method practised on the human or animal body 
(claims 20-25) Rule 39.1(iv) PCT - Method for treatment of the human or 
animal body by therapy (claims 26-28) 

2. | X | Claims Nos.: 11,20-29,35,39 

because they relate to parts of the international Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Claims Nos.: 



because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple Inventions In this International application, as follows: 



see additional sheet 



□ As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. 



2. | X I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
* — 1 covers only those claims for which fees were paid, specifically claims Nos.: 



No required additional search fees were timely paid by the applicant Consequently, this Internationa) Search Report is 
restricted to the Invention first mentioned In the claims; It Is covered by claims Nos.: 
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Continuation of Box 1.2 
Claims Nos.: 11,20-29,35,39 



Present claims 1-10,12-19, 30-34 and 36-38 and 40 are directed to an 
extremely large number of possible compounds and/or methods/uses. In 
fact, the claims contain so many options, undefined mammalian homologs, 
components of the complex considered similar solely by virtue of their 
encoding DNA hybridising under low stringency conditions and possible 
permutations that a lack of clarity (and/or conciseness) within the 
meaning of Article 6 PCT arises to such an extent as to render a 
meaningful search of the claims impossible. Consequently, the search has 
been carried out for those parts of the above claims which do appear to 
be clear and/or concise and or which find support in the description, 
namely the complexe(s) indicated in the examples. The same applies, 
mutatis mutandis, to any dependent claims. 

Claim 39, is directed to a generic component of the 

polyadenylati on-complex obtainable by the generic processes according to 
claims 36-38. Such a product is defined only in these broad terms and has 
therefore not been searched. 

Claims 11, 20-29 and 35 have not been subject to a search. Claims 20-25 
are directed to methods of diagnosis of disease. There are no direct 
clear and precise teachings pertaining to the use of the complexes and/or 
derivatives thereof in methods of diagnosis. The same applies to the 
subject-matter of claims 11, 26-29 and 35, which is directed to methods 
of treatment on the human and/or animal body and/or phamaceutical 
compositions. There is no clear teaching pertaining to precise 
indications of disease and modi of therapy, nor to pharmaceutical 
compositions. The skilled person would unable to ascertain such without 
either undue burden of experimentation or the use of inventive skill. 
Therefore, in the effective absence of such teaching, it is neither 
feasible nor worthwhile to carry out a search for such methods and/or 
compositions. 

Claim 41 refers in a) to SEQ ID numbers which are easily identified as 
individual components and are thus searched. The parts b) and c) of claim 
41 have however, not been searched, since they respectively refer to 
mammalian homologues/orthologues functionally active fragments or 
derivatives of the proteins (a) and (b) with furthermore undefined single 
or multiple substitutions deletions and/or additions. These latter have 
no support or teaching (Art. 84 and 83 EPC) and have therefore not been 
searched for similar reasons to those given above. The same applies to 
any dependent claimed subject-matter (claims 41-45). 

The applicant’s attention is drawn to the fact that claims, or parts of 
claims, relating to Inventions in respect of which no International 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EP0 policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
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the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 






